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A. predicted cDNA sequence of AtFtn2 (SEQ ID NO:l) 
(synonym: At5g42480; synonym: ARC6) gene 

Sequence length = 24 06 nt 

Start codon (ATG) is at position 1-3 

Stop codon (TAA) is at position 2404-2406 

1 ATGGAAGCTC TGAGTCACGT CGGCATTGGT CTCTCCCCAT TCCAATTATG CCGATTACCA 
61 CCGGCGACGA CAAAGCTCCG ACGTAGCCAC AACACCTCTA CAACTATCTG CTCCGCCAGC 
121 AAATGGGCCG ACCGTCTTCT CTCCGACTTC AATTTCACCT CCGATTCCTC CTCCTCCTCC 
181 TTCGCCACCG CCACCACCAC CGCCACTCTC GTCTCTCTGC CACCATCTAT TGATCGTCCc" 
241 GAACGCCACG TCCCCATCCC CATTGATTTC TACCAGGTAT TAGGAGCTCA AACACATTTC 
301 TTAACCGATG GAATCAGAAG AGCATTCGAA GCTAGGGTTT CGAAACCGCC GCAATTCGGT 
361 TTCAGCGACG ACGCTTTAAT CAGCCGGAGA CAGATTCTTC AAGCTGCTTG CGAAACTCTG 
421 TCTAATCCTC GGTCTAGAAG AGAGTACAAT GAAGGTCTTC TTGATGATGA AGAAGCTACA 
481 GTCATCACTG ATGTTCCTTG GGATAAGGTT CCTGGGGCTC TCTGTGTATT GCAAGAAGGT 
541 GGTGAGACTG AGATAGTTCT TCGGGTTGGT GAGGCTCTGC TTAAGGAGAG GTTGCCTAAG 
601 TCGTTTAAGC AAGATGTGGT TTTAGTTATG GCGCTTGCGT TTCTCGATGT CTCGAGGGAT 
661 GCTATGGCAT TGGATCCACC TGATTTTATT ACTGGTTATG AGTTTGTTGA GGAAGCTTTG 
721 AAGCTTTTAC AGGAGGAAGG AGCAAGTAGC CTTGCACCGG ATTTACGTGC ACAAATTGAT 
781 GAGACTTTGG AAGAGATCAC TCCGCGTTAT GTCTTGGAGC TACTTGGCTT ACCGCTTGGT 
841 GATGATTACG CTGCGAAAAG ACTAAATGGT TTAAGCGGTG TGCGGAATAT TTTGTGGTCT 
901 GTTGGAGGAG GTGGAGCATC AGCTCTTGTT GGGGGTTTGA CCCGTGAGAA GTTTATGAAT 
961 GAGGCGTTTT TACGAATGAC AGCTGCTGAG CAGGTTGATC TTTTTGTAGC TACCCCAAGC 
1021 AATATTCCAG CAGAGTCATT TGAAGTTTAC GAAGTTGCAC TTGCTCTTGT GGCTCAAGCT 
1081 TTTATTGGTA AGAAGCCACA CCTTTTACAG GATGCTGATA AGCAATTCCA GCAACTTCAG 
1141 CAGGCTAAGG TAATGGCTAT GGAGATTCCT GCGATGTTGT ATGATACACG GAATAATTGG 
1201 GAGATAGACT TCGGTCTAGA AAGGGGACTC TGTGCACTGC TTATAGGCAA AGTTGATGAA 
12 61 TGCCGTATGT GGTTGGGCTT AGACAGTGAG GATTCACAAT ATAGGAATCC AGCTATTGTG 
1321 GAGTTTGTTT TGGAGAATTC AAATCGTGAT GACAATGATG ATCTCCCTGG ACTATGCAAA 
1381 TTGTTGGAAA CCTGGTTGGC AGGGGTTGTC TTTCCTAGGT TCAGAGACAC CAAAGATAAA 
1441 AAATTTAAAC TCGGGGACTA CTATGATGAT CCTATGGTTT TGAGTTACTT GGAAAGAGTG 
1501 GAGGTAGTTC AGGGTTCTCC TTTAGCTGCT GCTGCAACTA TGGCAAGGAT TGGAGCCGAG 
1561 CATGTGAAAG CTAGTGCTAT GCAGGCACTG CAGAAAGTTT TTCCTTCCCG CTATACAGAT 
1621 AGAAACTCGG CTGAACCCAA GGATGTGCAA GAGACAGTGT TTAGTGTAGA TCCTGTTGGT 
1681 AACAATGTAG GCCGTGATGG TGAGCCTGGT GTCTTTATTG CAGAAQCTGT AAGACCCTCT 
1741 GAAAACTTTG AAACTAATGA TTATGCAATT CGAGCTGGGG TCTCAGAGAG TAGCGTTGAT 
1801 GAAACTACTG TTGAAATGTC CGTTGCTGAT ATGTTAAAGG AGGCAAGTGT GAAGATCCTA 
1861 GCTGCTGGTG TGGCAATTGG ACTGATTTCA CTGTTCAGCC AGAAGTATTT TCTTAAAAGC 
1921 AGCTCATCTT TTCAACGCAA GGATATGGTT TCTTCTATGG AATCTGATGT CGCTACCATA 
1981 GGGTCAGTCA GAGCTGACGA TTCAGAAGCA CTTCCCAGAA TGGATGCTAG GACTGCAGAG 
2 041 AATATAGTAT CCAAGTGGCA GAAGATTAAG TCTCTGGCTT TTGGGCCTGA TCACCGCATA 
2101 GAAATGTTAC CAGAGGTTTT GGATGGGCGA ATGCTGAAGA TTTGGACTGA CAGAGCAGCT 
2161 GAAACTGCGC AGCTTGGGTT GGTTTATGAT TATACACTGT TGAAACTATC TGTTGACAGT 
2221 GTGACAGTCT CAGCAGATGG AACCCGTGCT CTGGTGGAAG CAACTCTGGA GGAGTCTGCT 
2281 TGTCTATCTG ATTTGGTTCA TCCAGAAAAC AATGCTACTG ATGTCAGAAC CTACACAACA 
2341 AGATACGAAG TTTTCTGGTC CAAGTCAGGG TGGAAAATCA CTGAAGGCTC TGTTCTTGCA 
2401 TCATAA 
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B. Gemomic sequence of AtFtBi2 gene (§EQ4®^^Oa SEP ID NO;3) 
synoiiym: At5g42480; syiaoEym: ARC6) 

Sequence length = 3667 nt 

This sequence contains 480 nt of the 5' and 149 nt of the 3' region 

Start codon (ATG) is at position 481-483 
Stop codon (TAA) is at position 3516-3518 

1 TGTTCTGCAT TAAGGAGAAT ACAATTATAA GCAATTTGTC TTGATTTCAA CAAGATTTTG 
61 CTTGGCTATA GGATTCATTG GCTCTGTTTG CTTTTACATT TACATGTCAT AATAGTTTCG 
121 ^AATTTTACAC ATTTCAGTTG GATGTTAAGA AAAGAGAGGG AATTGATGGG GTTTTGTGGG 
181 TTTAAACTTT AAAGTAGTCA AGAATTAAGT CATTGGTTTA CTGTTGCTCT ATATGTGTAA 
241 AATGAAGGCA ACTCCAACGG TTCTTAGGTG GAATAGATTA TTTAGACGAT TTAACATCAT 
3 01 AAAGTCCGTG GCGACTGTAA CATCATAGAT TGTTTTTTAT TTTTTTCAGT AGCTGGTGAT 
3 61 GTTTTTTGAT TTAACTTATA CTACTCAAAA TCAAAATTCC ATAAACCCTA GACGACCAAA 
421 CAGTCTCTTC AATATGTAAA ACAGAACAAA GTTTTTGTAG TAGCCTAAAA AGACACTCCC 
481 ATGGAAGCTC TGAGTCACGT CGGCATTGGT CTCTCCCCAT TCCAATTATG CCGATTACCA 
541 CCGGCGACGA CAAAGCTCCG ACGTAGCCAC AACACCTCTA CAACTATCTG CTCCGCCAGC 
601 AAATGGGCCG ACCGTCTTCT CTCCGACTTC AATTTCACCT CCGATTCCTC CTCCTCCTCC 
661 TTCGCCACCG CCACCACCAC CGCCACTCTC GTCTCTCTGC CACCATCTAT TGATCGTCCC 
721 GAACGCCACG TCCCCATCCC CATTGATTTC TACCAGGTAT TAGGAGCTCA AACACATTTC 
7 81 TTAACCGATG GAATCAGAAG AGCATTCGAA GCTAGGGTTT CGAAACCGCC GCAATTCGGT 
841 TTCAGCGACG ACGCTTTAAT CAGCCGGAGA CAGATTCTTC AAGCTGCTTG CGAAACTCTG 
901 TCTAATCCTC GGTCTAGAAG AGAGTACAAT GAAGGTCTTC TTGATGATGA AGAAGCTACA 
961 GTCATCACTG ATGTTCCTTG GGATAAGGTA ATTTCGATTT CGGAATAATA AAGTTTCTTC 
1021 GTTTTAATTT CATGAATTGG ATAAAGGAAG GAACTTTTAT CTAGTGAAGG TTCCTGGGGC 
1081 TCTCTGTGTA TTGCAAGAAG GTGGTGAGAC TGAGATAGTT CTTCGGGTTG GTGAGGCTCT 
1141 GCTTAAGGAG AGGTTGCCTA AGTCGTTTAA GCAAGATGTG GTTTTAGTTA TGGCGCTTGC 
1201 GTTTCTCGAT GTCTCGAGGG ATGCTATGGC ATTGGATCCA CCTGATTTTA TTACTGGTTA 
1261 TGAGTTTGTT GAGGAAGCTT TGAAGCTTTT ACAGGTAGTT TGACTTGCTT TGGTAATTTG 
1321 ACGAGCGTTG GCTTTATAAG AACTTTCTTG ATTTGATACT TTGTTATTGA GTCTTGTGTA 
1381 GGAGGAAGGA GCAAGTAGCC TTGCACCGGA TTTACGTGCA CAAATTGATG AGACTTTGGA 
1441 AGAGATCACT CCGCGTTATG TCTTGGAGCT ACTTGGCTTA CCGCTTGGTG ATGATTACGC 
1501 TGCGAAAAGA CTAAATGGTT TAAGCGGTGT GCGGAATATT TTGTGGTCTG TTGGAGGAGG 
1561 TGGAGCATCA GCTCTTGTTG GGGGTTTGAC CCGTGAGAAG TTTATGAATG AGGCGTTTTT 
1621 ACGAATGACA GCTGCTGAGC AGGTATACAG TTTAGATACC TTTTTTTAAT TTCTTTAGCA 
1681 TGATATAACT TTAGGTTTCT CATTTTAATG TATGTTGTGT GGTAGGTTGA TCTTTTTGTA 
1741 GCTACCCCAA GCAATATTCC AGCAGAGTCA TTTGAAGTTT ACGAAGTTGC ACTTGCTCTT 
1801 GTGGCTCAAG CTTTTATTGG TAAGAAGCCA CACCTTTTAC AGGATGCTGA TAAGCAATTC 
1861 CAGCAACTTC AGCAGGCTAA GGTAATGGCT ATGGAGATTC CTGCGATGTT GTATGATACA 
1921 CGGAATAATT GGGAGATAGA CTTCGGTCTA GAAAGGGGAC TCTGTGCACT GCTTATAGGC 
1981 AAAGTTGATG AATGCCGTAT GTGGTTGGGC TTAGACAGTG AGGATTCACA ATATAGGAAT 
2041 CCAGCTATTG TGGAGTTTGT TTTGGAGAAT TCAAATCGTG ATGACAATGA TGATCTCCCT 
2101 GGACTATGCA AATTGTTGGA AACCTGGTTG GCAGGGGTTG TCTTTCCTAG GTTCAGAGAC 
2161 ACCAAAGATA AAAAATTTAA ACTCGGGGAC TACTATGATG ATCCTATGGT TTTGAGTTAC 
2221 TTGGAAAGAG TGGAGGTAGT TCAGGGTTCT CCTTTAGCTG CTGCTGCAAC TATGGCAAGG 
2281 ATTGGAGCCG AGCATGTGAA AGCTAGTGCT ATGCAGGCAC TGCAGAAAGT TTTTCCTTCC 
2341 CGCTATACAG ATAGAAACTC GGCTGAACCC AAGGATGTGC AAGAGACAGT GTTTAGTGTA 
2401 GATCCTGTTG GTAACAATGT AGGCCGTGAT GGTGAGCCTG GTGTCTTTAT TGCAGAAGCT 
2461 GTAAGACCCT . CTGAAAACTT TGAAACTAAT GATTATGCAA TTCGAGCTGG GGTCTCAGAG 
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2521 AGTAGCGTTG ATGAAACTAC TGTTGAAATG 
25 81 GTGAAGATCC TAGCTGCTGG TGTGGCAATT 
2641 TTTCTTAAAA GCAGCTCATC TTTTCAACGC 
2701 GTCGCTACCA TAGGTATGAT TAAATGATGC 
2761 TGCTTGTTTT GTGAGCTAAG AACATAGTTC 
2821 AAGATTAACA AGTTGCTGAG TAAATTTCAC 
2881 CTGTAGACAG AAATGTAAAT TTCACTCTCA 

2 941 AGATTGCCTT AGTGTGGCTT TGTCCT^CTT 
3001 AGGGTCAGTC AGAGCTGACG ATTCAGAAGC 

3 061 GAATATAGTA TCCAAGTGGC AGAAGATTAA 
3121 AGAAATGTTA CCAGAGGTGA. GGGAATAAAT 
3181 TGGACATGAT TATAGTCTGG TGCCTTGTTT 
3241 GAATGCTGAA GATTTGGACT GACAGAGCAG 

33 01 ATTATACACT GTTGAAACTA TCTGTTGACA 
3361 CTCTGGTGGA AGCAACTCTG GAGGAGTCTG 
3421 ACAATGCTAC TGATGTCAGA ACCTACACAA 

34 81 GGTGGAAAAT CACTGAAGGC TCTGTTCTTG 
3 541 GAGCTTGCGA GATTCTCTTT GTTCTGTAAA 
3601 ACACAAAAAA ATTAACGTTC TTGGCACACC 
3 661 GCTACAA 



TCCGTTGCTG ATATGTTAAA GGAGGCAAGT 
GGACTGATTT CACTGTTCAG CCAGAAGTAT 
AAGGATATGG TTTCTTCTAT GGAATCTGAT 
AATTTTCATA TATCTGCATT GCTCAAAATA 
CCACTTAATA CATGTCCCAA AAGTTGTACC 
TAATTATGCT GCTTGAATTT TTTGATCAAA 
ACATTTCTGT TTAGAATAAC GTAGGATTAG 
TTCTTTCCTT GATTTTTTTC TTTTCGATTT 
ACTTCCCAGA ATGGATGCTA GGACTGCAGA 
GTCTCTGGCT TTTGGGCCTG ATCACCGCAT 
CTACAATTCA ATCAATTGTG TGAAAACTGT 
GATTCTGTTA TTTATAGGTT TTGGATGGGC 
CTGAAACTGC GCAGCTTGGG TTGGTTTATG 
GTGTGACAGT CTCAGCAGAT GGAACCCGTG 
CTTGTCTATC TGATTTGGTT CATCCAGAAA 
CAAGATACGA AGTTTTCTGG TCCAAGTCAG 
CATCATAATA TACTCATATG TAGCATGTCT 
TTCTCTCTCT AAGTTAGTGT TTATAAATGA 
CTTTTCCTTG ATCTAAACTA TAACATAAGG 
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C. predicted cDNA sequence of mutated AtFtul gene (SEQ ID NO:9) 
synoniym: At5g42480; synonym: ARC6) 

Sequence length = 24 06 nt 

Start codon (ATG) is at position 1-3 

Premature stop codon (TGA) is at position 973-975 

Stop codon (TAA) is at position 2404-2406 

1 ATGGAAGCTC TGAGTCACGT CGGCATTGGT CTCTCCCCAT TCCAATTATG CCGATTACCA 
61 CCGGCGACGA CAAAGCTCCG ACGTAGCCAC AACACCTCTA CAACTATCTG CTCCGCCAGC 
121 AAATGGGCCG ACCGTCTTCT CTCCGACTTC MTT TCACCt CCGATTCCTC CTCCTCCTCC 
181 TTCGCCACCG CCACCACCAC CGCCACTCTC GTCTCTCTGC CACCATCTAT TGATCGTCCC 
241 GAACGCCACG TCCCCATCCC CATTGATTTC TACCAGGTAT TAGGAGCTCA AACACATTTC 

3 01 TTAACCGATG GAATCAGAAG AGCATTCGAA GCTAGGGTTT CGAAACCGCC GCAATTCGGT 
361 TTCAGCGACG ACGCTTTAAT CAGCCGGAGA CAGATTCTTC AAGCTGCTTG CGAAACTCTG 
421 TCTAATCCTC GGTCTAGAAG AGAGTACAAT GAAGGTCTTC TTGATGATGA AGAAGCTACA 

4 81 GTCATCACTG ATGTTCCTTG GGATAAGGTT CCTGGGGCTC TCTGTGTATT GCAAGAAGGT 
541 GGTGAGACTG AGATAGTTCT TCGGGTTGGT GAGGCTCTGC TTAAGGAGAG GTTGCCTAAG 
601 TCGTTTAAGC AAGATGTGGT TTTAGTTATG GCGCTTGCGT TTCTCGATGT CTCGAGGGAT 
661 GCTATGGCAT TGGATCCACC TGATTTTATT ACTGGTTATG AGTTTGTTGA GGAAGCTTTG 
721 AAGCTTTTAC AGGAGGAAGG AGCAAGTAGC CTTGCACCGG ATTTACGTGC ACAAATTGAT 
781 GAGACTTTGG AAGAGATCAC TCCGCGTTAT GTCTTGGAGC TACTTGGCTT ACCGCTTGGT 
841 GATGATTACG CTGCGAAAAG ACTAAATGGT TTAAGCGGTG TGCGGAATAT TTTGTGGTCT 
901 GTTGGAGGAG GTGGAGCATC AGCTCTTGTT GGGGGTTTGA CCCGTGAGAA GTTTATGAAT 
961 GAGGCGTTTT TATGAATGAC AGCTGCTGAG CAGGTTGATC TTTTTGTAGC TACCCCAAGC 
1021 AATATTCCAG CAGAGTCATT TGAAGTTTAC GAAGTTGCAC TTGCTCTTGT GGCTCAAGCT 
1081 TTTATTGGTA AGAAGCCACA CCTTTTACAG GATGCTGATA AGCAATTCCA GCAACTTCAG 
1141 CAGGCTAAGG TAATGGCTAT GGAGATTCCT GCGATGTTGT ATGATACACG GAATAATTGG 
12 01 GAGATAGACT TCGGTCTAGA AAGGGGACTC TGTGCACTGC TTATAGGCAA AGTTGATGAA 
12 61 TGCCGTATGT GGTTGGGCTT AGACAGTGAG GATTCACAAT ATAGGAATCC AGCTATTGTG 
1321 GAGTTTGTTT TGGAGAATTC AAATCGTGAT GACAATGATG ATCTCCCTGG ACTATGCAAA 
1381 TTGTTGGAAA CCTGGTTGGC AGGGGTTGTC TTTCCTAGGT TCAGAGACAC CAAAGATAAA 
1441 AAATTTAAAC TCGGGGACTA CTATGATGAT CCTATGGTTT TGAGTTACTT GGAAAGAGTG 
1501 GAGGTAGTTC AGGGTTCTCC TTTAGCTGCT GCTGCAGCTA TGGCAAGGAT TGGAGCCGAG 
1561 CATGTGAAAG CTAGTGCTAT GCAGGCACTG CAGAAAGTTT TTCCTTCCCG CTATACAGAT 
1621 AGAAACTCGG CTGAACCCAA GGATGTGCAA GAGACAGTGT TTAGTGTAGA TCCTGTTGGT 
1681 AACAATGTAG GCCGTGATGG TGAGCCTGGT GTCTTTATTG CAGAAGCTGT AAGACCCTCT 
1741 GAAAACTTTG AAACTAATGA TTATGCAATT CGAGCTGGGG TCTCAGAGAG TAGCGTTGAT 
1801 GAAACTACTG TTGAAATGTC CGTTGCTGAT ATGTTAAAGG AGGCAAGTGT GAAGATCCTA 
1861 GCTGCTGGTG TGGCAATTGG ACTGATTTCA CTGTTCAGCC AGAAGTATTT TCTTAAAAGC 
1921 AGCTCATCTT TTCAACGCAA GGATATGGTT TCTTCTATGG AATCTGATGT CGCTACCATA 
1981 GGGTCAGTCA GAGCTGACGA TTCAGAAGCA CTTCCCAGAA TGGATGCTAG GACTGCAGAG 

2 041 AATATAGTAT CCAAGTGGCA GAAGATTAAG TCTCTGGCTT TTGGGCCTGA TCACCGCATA 
2101 GAAATGTTAC CAGAGGTTTT GGATGGGCGA ATGCTGAAGA TTTGGACTGA CAGAGCAGCT 
2161 GAAACTGCGC AGCTTGGGTT GGTTTATGAT TATACACTGT TGAAACTATC TGTTGACAGT 
2221 GTGACAGTCT CAGCAGATGG AACCCGTGCT CTGGTGGAAG CAACTCTGGA GGAGTCTGCT 
22 81 TGTCTATCTG.ATTTGGTTCA TCCAGAAAAC AATGCTACTG ATGTCAGAAC CTACACAACA 
2341 AGATACGAAG TTTTCTGGTC CAAGTCAGGG TGGAAAATCA CTGT^GGCTC TGTTCTTGCA 
24 01 TCATAA 
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D. Genomic sequence of muitated AtFtn2 gene (SEQ ID NO:10) 
(synonym: At5g42480; synonym: ARC6) 

Sequence length = 3 667 nt 

This sequence contains 480 nt of the 5' and 149 nt of the 3' region 

Start codon (ATG) is at position 481-483 

Premature stop codon (TGA) is at position 1622-1624 

Stop codon (TAA) is at position 3516-3518 

1 TGTTCTGCAT TAAGGAGAAT ACAATTATAA GCAATTTGTC TTGATTTCAA CAAGATTTTG 
61 CTTGGCTATA GGATTCATTG GCTCTGTTTG CTTTTACATT TACATGTCAT AATAGTTTCG 
121 AATTTTACAC ATTTCAGTTG GATGTTAAGA AAAGAGAGGG AATTGATGGG GTTTTGTGGG 
181 TTTAAACTTT AAAGTAGTCA AGAATTAAGT CATTGGTTTA CTGTTGCTCT ATATGTGTAA 
241 AATGAAGGCA ACTCCAACGG TTCTTAGGTG GAATAGATTA TTTAGACGAT TTAACATCAT 
3 01 AAAGTCCGTG GCGACTGTTU^ CATCATAGAT TGTTTTTTAT TTTTTTCAGT AGCTGGTGAT 
361 GTTTTTTGAT TTAACTTATA CTACTCAAAA TCAAAATTCC ATAAACCCTA GACGACCAAA 
421 CAGTCTCTTC AATATGTAAA ACAGAACAAA GTTTTTGTAG TAGCCTAAAA AGACACTCCC 
481 ATGGAAGCTC TGAGTCACGT CGGCATTGGT CTCTCCCCAT TCCAATTATG CCGATTACCA 
541 CCGGCGACGA CAAAGCTCCG ACGTAGCCAC AACACCTCTA CAACTATCTG CTCCGCCAGC 
601 AAATGGGCCG ACCGTCTTCT CTCCGACTTC AATTTCACCT CCGATTCCTC CTCCTCCTCC 
661 TTCGCCACCG CCACCACCAC CGCCACTCTC GTCTCTCTGC CACCATCTAT TGATCGTCCC 
721 GAACGCCACG TCCCCATCCC CATTGATTTC TACCAGGTAT TAGGAGCTCA AACACATTTC 
781 TTAACCGATG GAATCAGAAG AGCATTCGAA GCTAGGGTTT CGAAACCGCC GCAATTCGGT 
841 TTCAGCGACG ACGCTTTAAT CAGCCGGAGA CAGATTCTTC AAGCTGCTTG CGAAACTCTG 
901 TCTAATCCTC GGTCTAGAAG AGAGTACAAT GAAGGTCTTC TTGATGATGA AGAAGCTACA 
961 GTCATCACTG ATGTTCCTTG GGATAAGGTA ATTTCGATTT CGGAATAATA AAGTTTCTTC 
1021 GTTTTAATTT CATGAATTGG ATAAAGGAAG GAACTTTTAT CTAGTGAAGG TTCCTGGGGC 
10 81 TCTCTGTGTA TTGCAAGAAG GTGGTGAGAC TGAGATAGTT CTTCGGGTTG GTGAGGCTCT 
1141 GCTTAAGGAG AGGTTGCCTA AGTCGTTTAA GCAAGATGTG GTTTTAGTTA TGGCGCTTGC 
12 01 GTTTCTCGAT GTCTCGAGGG ATGCTATGGC ATTGGATCCA CCTGATTTTA TTACTGGTTA 

12 61 TGAGTTTGTT GAGGAAGCTT TGAAGCTTTT ACAGGTAGTT TGACTTGCTT TGGTAATTTG 
1321 ACGAGCGTTG GCTTTATAAG AACTTTCTTG ATTTGATACT TTGTTATTGA GTCTTGTGTA 

13 81 GGAGGAAGGA GCAAGTAGCC TTGCACCGGA TTTACGTGCA CAAATTGATG AGACTTTGGA 
1441 AGAGATCACT CCGCGTTATG TCTTGGAGCT ACTTGGCTTA CCGCTTGGTG ATGATTACGC 
1501 TGCGAAAAGA CTAAATGGTT TAAGCGGTGT GCGGAATATT TTGTGGTCTG TTGGAGGAGG 
1561 TGGAGCATCA GCTCTTGTTG GGGGTTTGAC CCGTGAGAAG TTTATGAATG AGGCGTTTTT 
1621 ATGAATGACA GCTGCTGAGC AGGTATACAG TTTAGATACC TTTTTTTAAT TTCTTTAGCA 
1681 TGATATAACT TTAGGTTTCT CATTTTAATG TATGTTGTGT GGTAGGTTGA TCTTTTTGTA 
1741 GCTACCCCAA GCAATATTCC AGCAGAGTCA TTTGAAGTTT ACGAAGTTGC ACTTGCTCTT 
1801 GTGGCTCAAG CTTTTATTGG TAAGAAGCCA CACCTTTTAC AGGATGCTGA TAAGCAATTC 
1861 CAGCAACTTC AGCAGGCTAA GGTAATGGCT ATGGAGATTC CTGCGATGTT GTATGATACA 
1921 CGGAATAATT GGGAGATAGA CTTCGGTCTA GAAAGGGGAC TCTGTGCACT GCTTATAGGC 
1981 AAAGTTGATG AATGCCGTAT GTGGTTGGGC TTAGACAGTG AGGATTCACA ATATAGGAAT 
2041 CCAGCTATTG TGGAGTTTGT TTTGGAGAAT TCAAATCGTG ATGACAATGA TGATCTCCCT 
2101 GGACTATGCA AATTGTTGGA AACCTGGTTG GCAGGGGTTG TCTTTCCTAG GTTCAGAGAC 
2161 ACCAAAGATA AAAAATTTAA ACTCGGGGAC TACTATGATG ATCCTATGGT TTTGAGTTAC 
2221 TTGGAAAGAG TGGAGGTAGT TCAGGGTTCT CCTTTAGCTG CTGCTGCAGC TATGGCAAGG 
2281 ATTGGAGCCG AGCATGTGAA AGCTAGTGCT ATGCAGGCAC TGCAGAAAGT TTTTCCTTCC 
2341 CGCTATACAG ATAGAAACTC GGCTGAACCC AAGGATGTGC AAGAGACAGT GTTTAGTGTA 
24 01 GATCCTGTTG GTAACAATGT AGGCCGTGAT GGTGAGCCTG GTGTCTTTAT TGCAGAAGCT 
2461 GTAAGACCCT CTGAAAACTT TGAAACTAAT GATTATGCAA TTCGAGCTGG GGTCTCAGAG 
2521 AGTAGCGTTG ATGAAACTAC TGTTGAAATG TCCGTTGCTG ATATGTTAAA GGAGGCAAGT 
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2581 GTGAAGATCC TAGCTGCTGG TGTGGCAATT 
2641 TTTCTTAAAA GCAGCTCATC TTTTCAACGC 
2701 GTCGCTACCA TAGGTATGAT TAAATGATGC 
2761 TGCTTGTTTT GTGAGCTAAG AACATAGTTC 
2821 AAGATTAACA AGTTGCTGAG TAAATTTCAC 
2881 CTGTAGACAG AAATGTAAAT TTCACTCTCA 
2941 AGATTGCCTT AGTGTGGCTT TGTCCAACTT 
3 001 AGGGTCAGTC AGAGCTGACG ATTCAGAAGC 
3061 GAATATAGTA TCCAAGTGGC AGAAGATTAA 
3121 AGAAATGTTA CCAGAGGTGA GGGAATAAAT 
3181 TGGACATGAT TATAGTGTGG TGCCTTGTTT 
3241 GAATGCTGAA GATTTGGACT GACAGAGCAG 
33 01 ATTATACACT GTTGAAACTA TCTGTTGACA 
3361 CTCTGGTGGA AGCAACTCTG GAGGAGTCTG 
3421 ACAATGCTAC TGATGTCAGA ACCTACACAA 
3481 GGTGGAAAAT CACTGAAGGC TCTGTTCTTG 
3541 GAGCTTGCGA GATTCTCTTT GTTCTGTAAA 
3601 ACACAAAAAA ATTAACGTTC TTGGCACACC 
3661 GCTACAA 



GGACTGATTT CACTGTTCAG CCAGAAGTAT 
AAGGATATGG TTTCTTCTAT GGAATCTGAT 
AATTTTCATA TATCTGCATT GCTCAAAATA 
CCACTTAATA CATGTCCCAA AAGTTGTACC 
TAATTATGCT GCTTGAATTT TTTGATCAAA 
ACATTTCTGT TTAGAATAAC GTAGGATTAG 
TTCTTTCCTT GATTTTTTTC TTTTCGATTT 
ACTTCCCAGA ATGGATGCTA GGACTGCAGA 
GTCTCTGGCT TTTGGGCCTG ATCACCGCAT 
CTACAATTCA ATCAATTGTG TGAAAACTGT 
GATTCTGTTA TTTATAGGTT TTGGATGGGC 
CTGAAACTGC GCAGCTTGGG TTGGTTTATG 
GTGTGACAGT CTCAGCAGAT GGAACCCGTG 
CTTGTCTATC TGATTTGGTT CATCCAGAAA 
CAAGATACGA AGTTTTCTGG TCCAAGTCAG 
CATCATAATA TACTCATATG TAGCATGTCT 
TTCTCTCTCT AAGTTAGTGT TTATAAATGA 
CTTTTCCTTG ATCTAAACTA TAACATAAGG 
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Amino Acid Sequences 

A. predicted amino acid sequence of AtFtn2 

(synonym: At5g42480; synonym: ARC6) protein 

Sequence length = 801 aa 

1 MEALSHVGIG LSPFQLCRLP PATTKLRRSH NTSTTICSAS KWADRLLSDF NFTSDSSSSS 
- 61 FATATTTATL VSLPPSIDRP-^ ERHVPIPIDF YQVLGAQTHF LTDGIRRAFE ARVSKPPQFG 
121 FSDDALISRR QILQAACETL SNPRSRREYN EGLLDDEEAT VITDVPWiDKV PGALCVLQEG 
181 GETEIVLRVG EALLKERLPK SFKQDWLVM ALAFLDVSRD AMALDPPDFI TGYEFVEEAL 
241 KLLQEEGASS LAPDLRAQID ETLEEITPRY VLELLGLPLG DDYAAKRLNG LSGVRNILWS 
301 VGGGGASALV GGLTREKFMN EAFLRMTAAE QVDLFVATPS NIPAESFEVY EVALALVAQA 
361 FIGKKPHLLQ DADKQFQQLQ QAKVMAMEIP AMLYDTRNNW EIDFGLERGL CALLIGKVDE 
421 CRMWLGLDSE DSQYRNPAIV EFVLENSNRD DNDDLPGLCK LLETWLAGW FPRFRDTKDK 
481 KFKLGDYYDD PMVLSYLERV EWQGSPLAA AATMARIGAE HVKASAMQAL QKVFPSRYTD 
541 RNSAEPKDVQ ETVFSVDPVG NNVGRDGEPG VFIAEAVRPS ENFETNDYAI RAGVSESSVD 
601 ETTVEMSVAD MLKEASVKIL AAGVAIGLIS LFSQKYFLKS SSSFQRKDMV SSMESDVATI 
661 GSVRADDSEA LPRMDARTAE NIVSKWQKIK SLAFGPDHRI EMLPEVLDGR MLKIWTDRAA 
721 ETAQLGLVYD YTLLKLSVDS VTVSADGTRA LVEATLEESA CLSDLVHPEN NATDVRTYTT 
781 RYEVFWSKSG WKITEGSVLA S* 



B. predicted amino acid sequence of mutated AtFtn2 
(synonym: At5g42480; synonym: ARCS) protein 

Sequence length = 324 aa 

The mutated protein is truncated as a result of arc6 mutation 
(premature stop ) 

1 MEALSHVGIG LSPFQLCRLP PATTKLRRSH NTSTTICSAS KWADRLLSDF NFTSDSSSSS 
61 FATATTTATL VSLPPSIDRP ERHVPIPIDF YQVLGAQTHF LTDGIRRAFE ARVSKPPQFG 
121 FSDDALISRR QILQAACETL SNPRSRREYN EGLLDDEEAT VITDVPWDKV PGALCVLQEG 
181 GETEIVLRVG EALLKERLPK SFKQDWLVM ALAFLDVSRD AMALDPPDFI TGYEFVEEAL 
241 KLLQEEGASS LAPDLRAQID ETLEEITPRY VLELLGLPLG DDYAAKRLNG LSGVRNILWS 
301 VGGGGASALV GGLTREKFMN EAFL* 
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Synechococcus sp, PCC 7942 cell division protein Ftn2 gene 
A. Ftn2 DNA nucleic acid sequence (SEQ ID NO:4) 



1 cttgccgact aaaggctaag catcgccatt ccttagatta aagcagtctg tcggcggcgc 
61 tgtgccggtt aacaccagtc tgtcgctgac agcggtgcct ttctggggct tgcctgtggg 
121 gcgagtaacc gatcgctggg ataagagttg gtgcttctgg ctctcaagaa tagggttttc 
181 cgtcgcgtat tcccgatcac atccccctgt gtctgctacg gagataacgc cgatcactca 
241 acagaattgg taagttgacg gtcaagttgg gatgatgaag tcggctcaag ctggcgatcc 
301 ggatctggtg ggtgttctgt gcgtattcct ctcgattact accgaattct ctgtgttggc 
36 1- gtgcaagcct cggcagacaa acttgccgaa agctaccgcg atcgcctcaa ccaatcgccc - 
421 tcccatgagt tttcagagct ggcattgcag gcgcggcggc aactcctcga agcagcgatt 
481 gctgagctga gtgatcccga acagcgcgat cgctacgatc gccgcttttt tcagggcggt 
541 ctggaagcga ttgaaccaag cctagaactc gaagactggc agcgaattgg agccctgctg 
601 atcctgctgg aattggggga atacgatcgc gtttcgcaac tggctgagga actcctgcca 
661 gactacgacg cgagcgcaga agtacgcgat cagttcgcgc ggggtgatat cgccttggcg 
721 atcgcactat cccagcaatc cctcggtcga gaatgccgtc agcagggtct gtacgaacag 
781 gccgcccagc actttggccg cagccagtct gccctagccg atcatcagcg ctttcctgaa 
841 ctgagtcgaa ccctgcacca agaacaagga cagctacggc cctatcgcat tttggagcgg 
901 ttggcccagc ccttgactgc cgatagcgat cgccagcagg gtttgctgtt gttgcaggcg 
961 atgttggacg accggcaggg cattgaaggc cctggggatg atggctcggg gctgaccctt 
1021 gataactttt tgatgtttct ccagcaaatt cgcggctatc tgaccctggc tgaacagcag 
1081 ttgctgtttg aatcggaagc gcgtcggccc tcgccggctg cgagcttttt tgcctgctac 
1141 accctgattg cgcggggctt ttgcgatcac caaccctcgt tgatccatcg cgccagcttg 
1201 ctcttgcatg aactcaagag ccgcatggat gtgcacatcg aacaggcgat cgccagccta 
1261 ttgctcggac agcccgaaga agctgaggcg ctactcgtcc agagccaaga tgaggaaacc 
1321 ctcagccaaa tccgtgccct agcccaaggg gaagccctga tcgtcggttt gtgccgattc 
1381 acggaaacct ggctagcgac caaggtattt ccggatttcc gcgacctcaa ggaaaggact 
1441 gcgccgctgc agccctactt tgacgacccc gatgtccaga cctatctgga tgcgatcgtg 
1501 gagttgccgt ccgatttgat gccaacgccg ctacccgttg agccgcttga ggtgcgatcg 
1561 tcgttgctgg ccaaggaact gccgacccca gcaacgcctg gtgtagctcc accccctcgc 
1621 cgccgtcgcc gcgatcgctc cgaacgtcct gctcgcacgg ccaaacgctt gcccttgccc 
1681 tggattggtt tgggggttgt ggtggttctc ggcggtggaa caggggtttg ggcttggcga 
1741 tcgcgttcca attccacccc gccgaccccg ccccccgtgg ttcaaacgct gcctgaggcg 



FIG. 6 continued (2/2) 



1801 gtacctgccc cttcgcccgc gccagttacc gttgccctcg atcgggctca ggctgaaact 
1861 gtgttgcaaa actggttggc cgctaaagct gcagccttgg ggcctcaata cgatcgcgat 
1921 cgcttagcga cggtgctgac cggtgaggtt ctgcagactt ggcagggttt ttctagccag 
1981 caggccaaca cccagctcac atcacagttc gatcacaagt taaccgtcga ctcagttcag 
2041 ctcagtgacg gtgatcaacg agcagtagtc caagccaagg tcgatgaagt tgagcaggtc 
2101 tatcgaggcg accagctgct cgaaacgcgc cgagatttgg gcttggtgat ccgctaccag 
2161 ctcgtgcgcg agaacaacat ctggaaaatt gcttcgatta gtttggtgcg ctaggaattc 
2221 gcaaggggtg aaccccctgc ggtcttttct gtagatcccc tagagcgatc gcagaatgtt 
2281 cagcgattcc tggatgtgcg cttgggcatt caagagtgaa tcaaaaatgt ggcgcacctt 
2341 gccctctttg tcgatcacat aagtgacgcg acccggaatc acaaacaggg ttttgggcac 
2401 gccataggtt tgacggaggc gatcgcctgc atcgctcagc agttggaagg gcaagttgta 
2461 tttctgggc - - 



B. Ftn2 Protein amino acid sequence (SEQ ID NO:5) 

translation="MRIPLDYYRILCVGVQASADKLAESYRDRLNQSPSHEFSELALQ 

ARRQLLEAAIAELSDPEQRDRYDRRFFQGGLEAffiPSLELEDWQRIGALLILLELGEY 

DRVSQLAEELLPDYDASAEVRDQFARGDIALAIALSQQSLGRECRQQGLYEQAAQHFG 

RSQSALADHQRFPELSRTLHQEQGQLRPYRILERLAQPLTADSDRQQGLLLLQAMLDD 

RQGIEGPGDDGSGLTLDNFLMFLQQIRGYLTLAEQQLLFESEARRPSPAASFFACYTL 

lARGFCDHQPSLIHRASLLLHELKSRMDVHIEQAIASLLLGQPEEAEALLVQSQDEET 

LSQIRALAQGEALIYGLCRFTETWLATKVFPDFRDLKERTAPLQPYFDDPDVQTYLDA 

IVELPSDLMPTPLPVEPLEVRSSLLAKELPTPATPGVAPPPRRRRRDRSERPARTAKR 

LPLPWIGLGVVVVLGGGTGVWAWRSRSNSTPPTPPPVVQTLPEAVPAPSPAPVTVALD 

RAQAETVLQNWLAAKAAALGPQYDRDRLATVLTGEVLQTWQGFSSQQANTQLTSQFD 

HKLTVDSVQLSDGDQRAVVQAKVDEVEQVYRGDQLLETRRDLGLVIRYQLVRENNIW 

KIASISLVR" 
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Synechococcus sp. PCC 7942 cell division protein Ftn6 gene 

A. Ftn6 DNA nucleic acid sequence (SEQ ID NO:6) 

1 ctcgatactt gggagttgaa cacagagtag tagtctaagt aacaactgct cgtgagcaat 
61 ttgctacact ttttaccaaa ttttgagctc agttttcgcg aaaactggga tgttgagttg 
121 aaccctcagc agcaaaattg taccgcctga gacttttacc gttttattcg gccatctggg 
181 aacaatcgcc ctggagctta ttgtgacctc tacccgtact gccgttattg ccttgttaga 
241 acgctatttc gagctgtcgg cagcgcgagc agcagaggtc ttgcagcaac tgcgatcgca 
301 ccaccctgaa gcctggattt atcccgccap agtcgaggcg atttaccaag gccgttaccg 
361 ctgggtgtcg atcgcacaaa tccttgctct gtggcagcgg cgcgggcaga tcaactgcca 
421 cttcagtgca gactatgagc gcttgttgct cggtgaagtt ccagagcaac ccgatcgcat 
48 1 caatgttgag acgcggctcc ctgcgatcgc catgaccttg ccttgggtgc cagaacagcc 
541 tggagaagca ttcgtgccag cgcaagatca gtcgggttta actgagcgcc tttataaaac 
601 gttggtcaaa gcgggcagcg attgcgctgg gtaggcttag aacagttgcc atccaaactt 
661 gagagtgccc gttcggccag ccaagagaat tccaagagcc tttcagaacg gacaacaatt 
721 ctgctctaca atcaagcccg agtgaagagg cggcgggcta ttggctgaat ggcaaaaaac 
781 atcattcttt cagcaatcgt gggttatacc tacgacaaaa ttgacctatt cttaacttct 
841 gcactccgta acacctcagc agatattctt ttaattgcat caagtccttc agcccaactc 
901 cgtcatcagt tattgagttc acctcgggtc aaactcgttg atgtgaacct tcaaggtgaa 
961 ccagctgaaa tggtatttcg ccgtttcttt attgccaagg agattttggc gagaatcgaa 
1021 gcagatgaaa ttctcttgag cgatgctcgc gatgtctatt tccaatctga cccttttggt 
1081 gtccaagggg ttttatttgc cgaggaacct cagctaatcg caaactgtaa agtcaatagc 
1141 agctggataa aaaaatactt aggagaggat gagtttcaag ccatttctcc taatccaatt 
1201 ctctgcgggg gcaaccatgt gctggatgcc accaaggcct ttagcctgac gttgaccaca 
1261 ccagaagaaa ttgttgggct gcccgagagt ttgctggcct tggcggctca agctgctcaa 
1321 gccgctggtg aaacagaggc aacacccgaa gccggccctt ggcgaatcac cctcgacttc 
1381 ccaagctttg 



B. Ftn6 Protein amino acid sequence (SEQ ID NO:7) 

MGTIALELIVTSTRTAVIALLERYFELSAARAAEVLQQLRSHHP 

EAWIYPATVEAIYQGRYRWVSIAQILALWQRRGQINCHFSADYERLLLGEVPEQP 

DRINVETRLPAIAMTLPWVPEQPGEAFVPAQDQSGLTERLYKTLVKAGSDCAG 



IS 



"Replacement Sheet" 



First Set 

ACCESSION BK000999 



SEQ ID NO: 125: 
MEGFHNLLARPNSAPFAFSLPRPRPRPRRRPPPHPSAACRAASR 

WAERLFADFHLLPTAAPSDPPSPAPAPAAAPSASPFVPLFPDAAERSLPLQVDFYKVL 

GAEPHFLGDGIRRAFEARIAKPPQYGYSTDALVGRRQMLQIAHDTLMNQNSRTQYDRA 

LSENREEALTMDIAWDKEAGEALAVLWGEQLLLDRPPKRFKQDWLAMALAYVDLSR 

DAMAASPPDVIGCCEVLERALKLLQEDGASNLAPDLLSQIDETLEEITPRCVLELLSL 

PIDTEHHKKRQEGLQGARNILWSVGRGGIATVGGGFSREAFMNEAFLRMTSIEQMDFF 

SKTPNSIPPEWFEIYNVALAHVAQAIISKRPQFIMMADDLFEQLQKFNIGSHYAYDNE 

MDLALERAFCSLLVGDVSKCRMWLGIDNESSPYRDPKILEFIVTNSSISEENDLLPGL 

CKLLETWLIFEVFPRSRDTRGMQFRLGDYYDDPEVLSYLERMEGGGASHLAAAAAIAK 

LGAQATAALGTVKSNAIQAFNKVFPLIEQLDRSAMENTKDGPGGYLENFDQENAPAHD 

SRNAALKIISAGALFALLAVIGAKYLPRKRPLSAIRSEHGSVAVANSVDSTDDPALDE 

DPVHI PRMDAKLAED I VRKWQS I KS KALGPEHS VASLQEVLDGNMLKVWTDRAAE I ER 

HGWFWEYTLSDVTIDSITISLDGRRATVEATIDEAGQLTDVTEPRNNDSYDTKYTTRY 

EMAFSKLGGWKITEGAVLKS " 



SEQ ID NO: 126: 

BASE COUNT 551 a 576 C 592 g 564 t 

ORIGIN 

1 atggagggct tccacaacct cctcgcccgc cccaactcgg cgccattcgc cttctccctc 
61 cctcgcccgc gcccgcgccc gcgccgcagg ccgccgcctc acccctccgc tgcctgccgc 
121 gccgcgagcc gctgggccga acgcctcttc gccgacttcc acctcctccc caccgccgcg 
181 ccctccgacc cgccgtcccc ggccccggcc ccggccgccg cgccctccgc ctcccccttc 
241 gtcccgctct tccccgacgc cgccgaacgc tccctcccgc tccaagtcga tttctacaag 
301 gttctagggg cagagccaca tttccttggc gatggcatca ggagggcgtt cgaggcacgg 
361 atagccaagc caccgcagta tggctacagc acggatgctc ttgttggtcg tcgacaaatg 
421 ctgcagattg cccatgacac tctcatgaac cagaactccc gcactcagta tgatcgtgcg 
481 ctttctgaga accgtgaaga agctctcacc atggatattg cttgggacaa ggaggctggg 
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541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 



gaggcacttg 
ttcaagcagg 
atggcagcaa 
ctcttgcagg 
actctcgagg 
gagcatcata 
ggcagaggag 
gcttttttga 
attcctcctg 
ataagtaaaa 
ttcaacatag 
ttctgctcat 
gagtcttcac 
agtgaagaga 
gaggtttttc 
gatgatccag 
gctgctgctg 
aaatcaaatg 
tcagccatgg 
aatgcacctg 
tttgcactgt 
attaggagtg 
gcactagatg 
gttcgcaagt 
ttgcaagagg 
gagcgtcatg 
atctccctag 
actgatgtta 
gagatggcct 
tag 



ctgtgcttgt 
acgtggtgct 
gccctccaga 
aagatggagc 
agattacacc 
agaagcgcca 
gtattgctac 
ggatgacatc 
aatggtttga 
ggccacaatt 
gttctcatta 
tgctagtcgg 
catacagaga 
atgatcttct 
ctaggagcag 
aagttttaag 
ctgctattgc 
ctattcaagc 
aaaatactaa 
ctcatgattc 
tggcagtaat 
agcatggatc 
aagatccagt 
ggcagagtat 
ttcttgatgg 
ggtggttctg 
atggtcgacg 
ctgagcccag 
tctccaagct 



aactggagaa 
agcgatggct 
tgtaattggc 
aagcaatctc 
tcgctgtgta 
agaagggctt 
cgttggagga 
aattgaacag 
aatttacaat 
catcatgatg 
tgcttatgat 
agatgttagc 
ccccaaaatt 
tccagggctg 
agatactcgg 
ctacctagaa 
aaaacttggt 
gttcaacaag 
agatggccct 
gagaaatgcc 
tggggccaaa 
tgtggcagtt 
acatattcct 
caaatctaag 
caacatgcta 
ggagtataca 
agcgactgtg 
aaacaatgat 
aggagggtgg 



cagttgcttc 
ctggcttatg 
tgctgcgagg 
gcacctgatc 
ttggagcttc 
caaggtgcga 
ggattttctc 
atggatttct 
gtagcacttg 
gcggatgatc 
aatgagatgg 
aagtgcagaa 
ctagagttta 
tgcaagcttt 
ggcatgcagt 
aggatggagg 
gctcaagcta 
gtttttccat 
gggggatatc 
gccttgaaga 
tatttgcctc 
gctaatagtg 
agaatggatg 
gccttgggac 
aaggtgtgga 
ctatccgatg 
gaggctacga 
tcatatgaca 
aagataacgg 



tggatcggcc 
tggatctatc 
tgctcgagag 
tgctttcaca 
tctcccttcc 
gaaacatttt 
gtgaagcctt 
tttcaaaaac 
cacatgtcgc 
tttttgaaca 
accttgcatt 
tgtggcttgg 
ttgtgaccaa 
tggagacttg 
tcagacttgg 
gtggtggtgc 
cagctgcact 
tgatagaaca 
ttgaaaattt 
ttatctctgc 
gtaagaggcc 
tcgactctac 
cgaagctggc 
cagaacattc 
ctgaccgagc 
tgacgattga 
ttgatgaggc 
caaaatacac 
aaggagcagt 



acccaagcgc 
aagggatgct 
ggctctcaag 
gattgatgaa 
tattgacaca 
gtggagcgtt 
catgaacgag 
accgaatagc 
tcaagcaatt 
actccagaag 
ggaaagggca 
aattgataat 
ctctagcatc 
gcttatcttt 
agattactac 
ttctcatttg 
tggtactgtg 
gttagacagg 
tgaccaggaa 
tggcgcactg 
cctttctgct 
tgatgatcct 
agaagatatt 
ggttgcatca 
agcggagatt 
tagcatcact 
aggccaactt 
tacccggtat 
cctcaagtcg 
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ACCESSION BAB10489 
SEQ ID NO: 127: 

1 mealshvgig Ispfqlcrlp pattklrrsh ntstticsas kwadrllsdf nftsdsssss 
61 fatatttatl vspppsidrp erhvpipidf yqvlgaqthf ltdgirrafe arvskppqfg 
121 fsddalisrr qilqaacetl snprsrreyn egllddeeat vitdvpwdkv pgalcvlqeg 
181 geteivlrvg eallkerlpk sfkqdwlvm alafldvsrd amaldppdfi tgyefveeal 
241 kllqeegass lapdlraqid etleeitpry vlellglplg ddyaakrlng Isgvrnilws 
3 01 vggggasalv ggltrekfmn eaflrmtaae qvdlfvatps nipaesfevy evalalvaqa 
361 figkkphllq dadkqfqqlq qakvmameip amlydtrnnw eidfglergl calligkvde 
421 crmwlgldse dsqyrnpaiv efvlensnrd dnddlpglck lletwlagw fprfrdtkdk 
481 kfklgdyydd pmvlsylerv ewqgsplaa aaamarigae hvkasamqal qkvfpsrytd 
541 rnsaepkdvq etvfsvdpvg nnvgrdgepg vfiaeavrps enfetndyai ragvsessvd 
601 ettvemsvad mlkeasvkil aagvaiglis Ifsqkyflks sssfqrkdmv ssmesdvati 
661 gsvraddsea Iprmdartae nivskwqkik slafgpdhri emlpevldgr mlkiwtdraa 
721 etaqlglvyd ytllklsvds vtvsadgtra Iveatleesa clsdlvhpen natdvrtytt 
7 81 ryevfwsksg wkitegsvla s 

// 



"Replacement Sheet" 
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NM_123613.1| (SEQ ID NO: 128) : 

ATGGAAGCTCTGAGTCACGTCGGCATTGGTCTCTCCCCATTCCAATTATGCCGATTACCACCGGCGACGA 

CAAAGCTCCGACGTAGCCACAACACCTCTACAACTATCTGCTCCGCCAGCAAATGGGCCGACCGTCTTCT 

CTCCGACTTCAATTTCACCTCCGATTCCTCCTCCTCCTCCTTCGCCACCGCCACCACCACCGCCACTCTC 

GTCTCTCCGCCACCATCTATTGATCGTCCCGAACGCCACGTCCCCATCCCCATTGATTTCTACCAGGTAT 

TAGGAGCTCAAACACATTTCTTAACCGATGGAATCAGAAGAGCATTCGAAGCTAGGGTTTCGAAACCGCC 

GCAATTCGGTTTCAGCGACGACGCTTTAATCAGCCGGAGACAGATTCTTCAAGCTGCTTGCGAAACTCTG 

TCTAATCCTCGGTCTAGAAGAGAGTACAATGAAGGTCTTCTTGATGATGAAGAAGCTACAGTCATCACTG 

ATGTTCCTTGGGATAAGGTTCCTGGTGCTCTCTGTGTATTGCAAGAAGGTGGTGAGACTGAGATAGTTCT 

TCGGGTTGGTGAGGCTCTGCTTAAGGAGAGGTTGCCTAAGTCGTTTAAGCAAGATGTGGTTTTAGTTATG 

GCGCTTGCGTTTCTCGATGTCTCGAGGGATGCTATGGCATTGGATCCACCTGATTTTATAACTGGTTATG 

AGTTTGTTGAGGAAGCTTTGAAGCTTTTACAGGAGGAAGGAGCAAGTAGCCTTGCACCGGATTTACGTGC 

ACAAATTGATGAGACTTTGGAAGAGATCACTCCGCGTTATGTCTTGGAGCTACTTGGeTTACCGGTTGGT 

GATGATTACGCTGCGAAAAGACTAAATGGTTTAAGCGGTGTGCGGAATATTTTGTGGTCTGTTGGAGGAG 

GTGGAGCATCAGCTCTTGTTGGGGGTTTGACCCGTGAGAAGTTTATGAATGAGGCGTTTTTACGAATGAC 

AGCTGCTGAGCAGGTTGATCTTTTTGTAGCTACCCCAAGCAATATTCCAGCAGAGTCATTTGAAGTTTAC 

GAAGTTGCACTTGCTCTTGTGGCTCAAGCTTTTATTGGTAAGAAGCCACACCTTTTACAGGATGCTGATA 

AGCAATTCCAGCAACTTCAGCAGGCTAAGGTAATGGCTATGGAGATTCCTGCGATGTTGTATGATACACG 

GAATAATTGGGAGATAGACTTCGGTCTAGAAAGGGGACTCTGTGCACTGCTTATAGGCAAAGTTGATGAA 

TGCCGTATGTGGTTGGGCTTAGACAGTGAGGATTCACAATATAGGAATCCAGCTATTGTGGAGTTTGTTT 

TGGAGAATTCAAATCGTGATGACAATGATGATCTCCCTGGACTATGCAAATTGTTGGAAACCTGGTTGGC 

AGGGGTTGTCTTTCCTAGGTTCAGAGACACCAAAGATAAAAAATTTAAACTCGGGGACTACTATGATGAT 

CCTATGGTTTTGAGTTACTTGGAAAGAGTGGAGGTAGTTCAGGGTTCTCCTTTAGCTGCTGCTGCAGCTA 

TGGCAAGGATTGGAGCCGAGCATGTGAAAGCTAGTGCTATGCAGGCACTGCAGAAAGTTTTTCCTTCCCG 

CTATACAGATAGAAACTCGGCTGAACCCAAGGATGTGCAAGAGACAGTGTTTAGTGTAGATCCTGTTGGT 

AACAATGTAGGCCGTGATGGTGAGCCTGGTGTCTTTATTGCAGAAGCTGTAAGACCCTCTGAAAACTTTG 

AAACTAATGATTATGCAATTCGAGCTGGGGTCTCAGAGAGTAGCGTTGATGAAACTACTGTTGAAATGTC 

CGTTGCTGATATGTTAAAGGAGGCAAGTGTGAAGATCCTAGCTGCTGGTGTGGCAATTGGACTGATTTCA 

CTGTTCAGCCAGAAGTATTTTCTTAAAAGCAGCTCATCTTTTCAACGCAAGGATATGGTTTCTTCTATGG 

AATCTGATGTCGCTACCATAGGGTCAGTCAGAGCTGACGATTCAGAAGCACTTCCCAGAATGGATGCTAG 

GACTGCAGAGAATATAGTATCCAAGTGGCAGAAGATTAAGTCTCTGGCTTTTGGGCCTGATCACCGCATA 

GAAATGTTACCAGAGGTTTTGGATGGGCGAATGCTGAAGATTTGGACTGACAGAGCAGCTGAAACTGCGC 

AGCTTGGGTTGGTTTATGATTATACACTGTTGAAACTATCTGTTGACAGTGTGACAGTCTCAGCAGATGG 

AACCCGTGCTCTGGTGGAAGCAACTCTGGAGGAGTCTGCTTGTCTATCTGATTTGGTTCATCCAGAAAAC 

AATGCTACTGATGTCAGAACCTACACAACAAGATACGAAGTTTTCTGGTCCAAGTCAGGGTGGAAAATCA 

CTGAAGGCTCTGTTCTTGCATCATAA 

NP_199063.1 (SEQ ID NO:129): 

MEALSHVGIGLSPFQLCRLPPATTKLRRSHNTSTTICSASKWADRLLSDFNFTSD.SSSSSFATATTTATL 

VSPPPSIDRPERHVPIPIDFYQVLGAQTHFLTDGIRRAFEARVSKPPQFGFSDDALISRRQILQAACETL 

SNPRSRREYNEGLLDDEEATVITDVPWDKVPGALCVLQEGGETEIVLRVGEALLKERLPKSFKQDWLVM 

ALAFLDVSRDAMALDPPDFITGYEFVEEALKLLQEEGASSLAPDLRAQIDETLEEITPRYVLELLGLPLG 

DDYAAKRLNGLSGVRNILWSVGGGGASALVGGLTREKFMNEAFLRMTAAEQVDLFVATPSNIPAESFEVY 

EVALALVAQAFIGKKPHLLQDADKQFQQLQQAKVMAMEIPAMLYDTRNNWEIDFGLERGLCALLIGK^ 

CRMWLGLDSEDSQYRNPAIVEFVLENSNRDDNDDLPGLCKLLETWLAGWFPRFRDTKDKKFKLGDYYDD 

PMVLSYLERVEWQGSPLAAAAAMARIGAEHVKASAMQALQKVFPSRYTDRNSAEPKDVQETVFSVDPV 

NNVGRDGEPGVFIAEAVRPSENFETNDYAIRAGVSESSVDETTVEMSVADMLKEASVKILAAGVAIGLIS 

LFSQKYFLKSSSSFQRKDMVSSMESDVATIGSVRADDSEALPRMDARTAENIVSKWQKIKSLAFGPDHRI 

EMLPEVLDGRMLKIWTDRAAETAQLGLVYDYTLLKLSVDSVTVSADGTRALVEATLEESACLSDLVHPEN 

NATDVRTYTTRYEVFWSKSGWKITEGSVLAS 
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AY091075.1 (SEQ ID NO:130): 

GATTTAACTTATACTACTCAAAATCAAAATTCCATAAACCCTAGACGACCAAACAGTCTCTTCAATATGT 

AAAACAGAACAAAGTTTTTGTAGTAGCCTAAAAAGACACTCCCATGGAAGCTCTGAGTCACGTCGGCATT 

GGTCTCTCCCCATTCCAATTATGCCGATTACCACCGGCGACGACAAAGCTCCGACGTAGCCACAACACCT 

CTACAACTATCTGCTCCGCCAGCAAATGGGCCGACCGTCTTCTCTCCGACTTCAATTTCACCTCCGATTC 

CTCCTCCTCCTCCTTCGCCACCGCCACCACCACCGCCACTCTCGTCTCTCCGCCACCATCTATTGATCGT 

CCCGAACGCCACGTCCCCATCCCCATTGATTTCTACCAGGTATTAGGAGCTCAAACACATTTCTTAACCG 

ATGGAATCAGAAGAGCATTCGAAGCTAGGGTTTCGAAACCGCCGCAATTCGGTTTCAGCGACGACGCTTT 

AATCAGCCGGAGACAGATTCTTCAAGCTGCTTGCGAAACTCTGTCTAATCCTCGGTCTAGAAGAGAGTAC 

AATGAAGGTCTTCTTGATGATGAAGAAGCTACAGTCATCACTGATGTTCCTTGGGATAAGGTTCCTGGTG 

CTCTCTGTGTATTGCAAGAAGGTGGTGAGACTGAGATAGTTCTTCGGGTTGGTGAGGCTCTGCTTAAGGA 

GAGGTTGCCTAAGTCGTTTAAGCAAGATGTGGTTTTAGTTATGGCGCTTGCGTTTCTCGATGTCTCGAGG 

GATGCTATGGCATTGGATCCACCTGATTTTATAACTGGTTATGAGT-TTGTTGAGGAAGCTTTGAAGCTTT _ 

TACAGGAGGAAGGAGCAAGTAGCCTTGCACCGGATTTACGTGCACAAATTGATGAGACTTTGGAAGAGAT 

CACTCCGCGTTATGTCTTGGAGCTACTTGGCTTACCGCTTGGTGATGATTACGCTGCGAAAAGACTAAAT 

GGTTTAAGCGGTGTGCGGAATATTTTGTGGTCTGTTGGAGGAGGTGGAGCATCAGCTCTTGTTGGGGGTT 

TGACCCGTGAGAAGTTTATGAATGAGGCGTTTTTACGAATGACAGCTGCTGAGCAGGTTGATCTTTTTGT 

AGCTACCCCAAGCAATATTCCAGCAGAGTCATTTGAAGTTTACGAAGTTGCACTTGCTCTTGTGGCTCAA 

GCTTTTATTGGTAAGAAGCCACACCTTTTACAGGATGCTGATAAGCAATTCCAGCAACTTCAGCAGGCTA 

AGGTAATGGCTATGGAGATTCCTGCGATGTTGTATGATACACGGAATAATTGGGAGATAGACTTCGGTCT 

AGAAAGGGGACTCTGTGCACTGCTTATAGGCAAAGTTGATGAATGCCGTATGTGGTTGGGCTTAGACAGT 

GAGGATTCACAATATAGGAATCCAGCTATTGTGGAGTTTGTTTTGGAGAATTCAAATCGTGATGACAATG 

ATGATCTCCCTGGACTATGCAAATTGTTGGAAACCTGGTTGGCAGGGGTTGTCTTTCCTAGGTTCAGAGA 

CACCAAAGATAAAAAATTTAAACTCGGGGACTACTATGATGATCCTATGGTTTTGAGTTACTTGGAAAGA 

GTGGAGGTAGTTCAGGGTTCTCCTTTAGCTGCTGCTGCAGCTATGGCAAGGATTGGAGCCGAGCATGTGA 

AAGCTAGTGCTATGCAGGCACTGCAGAAAGTTTTTCCTTCCCGCTATACAGATAGAAACTCGGCTGAACC 

CAAGGATGTGCAAGAGACAGTGTTTAGTGTAGATCCTGTTGGTAACAATGTAGGCCGTGATGGTGAGCCT 

GGTGTCTTTATTGCAGAAGCTGTAAGACCCTCTGAAAACTTTGAAACTAATGATTATGCAATTCGAGCTG 

GGGTCTCAGAGAGTAGCGTTGATGAAACTACTGTTGAAATGTCCGTTGCTGATATGTTAAAGGAGGCAAG 

TGTGAAGATCCTAGCTGCTGGTGTGGCAATTGGACTGATTTCACTGTTCAGCCAGAAGTATTTTCTTAAA 

AGCAGCTCATCTTTTCAACGCAAGGATATGGTTTCTTCTATGGAATCTGATGTCGCTACCATAGGGTCAG 

TCAGAGCTGACGATTCAGAAGCACTTCCCAGAATGGATGCTAGGACTGCAGAGAATATAGTATCCAAGTG 

GCAGAAGATTAAGTCTCTGGCTTTTGGGCCTGATCACCGCATAGAAATGTTACCAGAGGTTTTGGATGGG 

CGAATGCTGAAGATTTGGACTGACAGAGCAGCTGAAACTGCGCAGCTTGGGTTGGTTTATGAT.TATACAC 

TGTTGAAACTATCTGTTGACAGTGTGACAGTCTCAGCAGATGGAACCCGTGCTCTGGTGGAAGCAACTCT 

GGAGGAGTCTGCTTGTCTATCTGATTTGGTTCATCCAGAAAACAATGCTACTGATGTCAGAACCTACACA 

ACAAGATACGAAGTTTTCTGGTCCAAGTCAGGGTGGAAAATCACTGAAGGCTCTGTTCTTGCATCATAAT 

ATACTCATATGTAGCATGTCTGAGCTTGCGAGATTCTCTTTGTTTTGTAAATTCTCTCTCTAAGTTAGTG 

TTTATAAATGAACACAAAAAAATTAACGTTCAAAAAAAAAAAAAAAA 
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ACCESSION AAM13895 (SEQ ID N0:131): 

1 mealshvgig Ispfqlcrlp pattklrrsh ntstticsas kwadrllsdf nftsdsssss 

61 fatatttatl vspppsidrp erhvpipidf yqvlgaqthf ltdgirrafe arvskppqfg 

121 fsddalisrr qilqaacetl snprsrreyn egllddeeat vitdvpwdkv pgalcvlqeg 

181 geteivlrvg eallkerlpk sfkqdwlvm alafldvsrd amaldppdfi tgyefveeal 

241 kllqeegass lapdlraqid etleeitpry vlellglplg ddyaakrlng Isgvrnilws 

301 vggggasalv ggltrekfmn eaflrmtaae qvdlfvatps nipaesfevy evalalvaqa 

361 figkkphllq dadkqfqqlq qakvmameip amlydtrnnw eidfglergl calligkvde 

421 crmwlgldse dsqyrnpaiv efvlensnrd dnddlpglck lletwlagw fprfrdtkdk 

481 kfklgdyydd pmvlsylerv ewqgsplaa aaamarigae hvkasamqal qkvfpsrytd 

541 rnsaepkdvq etvfsvdpvg nnvgrdgepg vfiaeavrps enfetndyai ragvsessvd 

601 ettvemsvad mlkeasvkil aagvaiglis Ifsqkyflks sssfqrkdmv ssmesdvati 

_ 661 gsvraddsea _lprmdartae niyskwqkik slafgpdhri emlpevldgr mlkiwtdraa 

721 etaqlglvyd ytllklsvds vtvsadgtra Iveatleesa clsdlvhpen natdvrtytt 
781 ryevfwsksg wkitegsvla s 

// 
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GenBank Acc: AI998415 (SEQ Id NO: 132) : 

ATAAACACTAACTTAGAGAGAGAATTTACAAAACAAAGAGAATCTCGCAAGCTCAGACAT 

GCTACATATGAGTATATTATGATGCAAGAACAGAGCCTTCAGTGATTTTCCACCCTGACT 

TGGACCNGAAAACTTCGTATCTTGTTGTGTAGGTTCTGACATCAGTAGCATTGTTTTCTG 

GATGAACCAAATCAGATAGACAAGCAGACTCCTCCAGAGTTGCTTCCACCAGAGCACGGG 

TTCCATCTGCTGAGACTGTCACACTGTCAACAGATAGTTTCAACAGTGTATAATCATAAA 

CCAACCCAAGCTGCGCAGTTTCAGCTGCTCTGTCAGTCCAAATCTTCAGCATTCGCCCAT 

CCAAAACCTCTGGTAACATTTCTATGCGGTGATCAGGCCCAAAAGCCAGAGACTTAATCT 

TCTGCCACTTGGATACTATATTCTCTGCAGTCCTAGCATCCATTCTGGGAAGTGCTTCTG 

AATCGTCAGCTCTGACTGACCCTATGGTAGCGACATCAGNTTCCATAGAAGAAACCATAT 
NCTTGCGTTGAAAAGATGAGC 

GenBank Acc: AL382914 (SEQ ID NO: 133) : 

CTGGTGTAGCAATTGGACTCATAACTTTAGCTGGTTTGAAGATTTTACCTTCTAAAAATG 

GCTCGCCCGTTCTTCACAAAGTGACTGGTTCAGCAATTGCGTCAGATACTATCAATTTAG 

GTCCTGTAGGAGATGAAGAATTAGGAGAGCAACTACCAAAAATGAGTGCAATGGTTGCAG 

AAGCTCTAGTCCGCAAGTGGCAATATATCACATCCCAAGCTTTTGGACCTGACCATTGCC 

TAGGAAGATTGCAAGAGGTGTTGGACGGCCAAATGTTGAAGATATGGACTGATCG 

GenBank Acc: AL382915 (SEQ ID NO:134): 

CCCAAGCTTTTGGACCTGACCATTGCCTAGGAAGATTGCAAGAGGTGTTGGACGGCGAAA 

TGTTGAAGATATGGACTGATCGAGCAGCTGAGATTGCAGAGCTTGGTTGGTCATATGACT 

ACAACTTGGAGGATCTCAACATCGACAGTGTGACCATATCACAGAATGGGCGGCGTGCAG 

TAGTGGAAACAACTCTCAAAGAGTCTACCCACCTCACTGCTGTTGGTCATCCACAGCATG 

CTACTTCCAACAGCAGAACCTACACAACAAGATATGAAATGTCTTTTTCAGATTCAGGGT 

GGAAAATTATTGAAGGAGCTGTCCTTGAGTCGTAATTAGGTTTTGTAATATGTAATATAT 

GTCAGGTTAGTACACTTCAATATTAACCCCCTCGAGCCTATGCCCACTGTCTTGTATGTA 

CCTGTTGTTTTGTGCATTTTTCAAGCATTTATGTAGTCAGGCTGTAAATAGTTGGAGGGT 

ATTTGATCAAATAATTATCCGGTTAAAAAAAAAAAAAAAAAAAAAAA 
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GenBank Acc: BI268376 (SEQ ID NO:135) : 

CACGCTTCTCCAAAAAACCTAACCGTCTCCATTCCTCCGCCGTCTCCGCCACCAGTAAAT 
GGGCGGAGCGACTCATTTCCGATTTCCAATTCCTCGGCGACACCTCCTCTTCCTCCTCCA 
CCACCACCTCCGCCACAGTCACTCTCACTCCTTCTTACCCTCCTCCGATAGAACGCCACG 
TGTCACTCCCTCTCGACCTGTACAAAATCCTCGGCGCCGAAACGCATTTTCTCGGTGATG 
GTATTCGGAGAGCTTATGAAGCGAAATTCTCGAAGCCTCCTCAGTATGCTTTCAGTAATG 
AAGCTTTGATTAGTCGTCGTCAGATTCTTCAAGCTGCTTGTGAAACCCTAGCTGATCCTG 
CTTCTAGAAGAGAiSTATAATCAAAGCCTCGTCGACGATGAAGACGAAGATGAGGAATCTT^ 
CCATTCTCACTGAAATCCCTTTCGACAAAGTTCCTGGAGCTCTGTGCGTGTTGCAAGAAG 
CTGGAGAGACGGAGTTGGTGCTTCGGATTGGAGGGGGTTTACTGAGAGAGAGGTTACCGA 
AGATGTTTAAGCAAGATGTTGTGTTGGCTATGGCGCTTGCATATGTTGACGTTTCTAGGG 
ATGCTATGGCTTTGTCCCCGCCAGATTTCATTGTTGCTTGTGAGATGCTGGAAAGGGCAT 

AW472683 (SEQ ID NO:136): 

AGCGTTGTGTGTGTTGCAGGAAGCTGGAGAGACGGAGCTTGTGCTTGAGATTGGGCAGGG 

TTTGCTTAGGGAGAGGTTGCCGAAGACGTTTAAGCAGGATGTTGTGTTGGCTATGGCACT 

CGCATTTGTTGACGTGTCAAGGGATGCTTGGCTTGTTCACCGGATTTCATTGCGGCTGTG 
AGATGCT 

BE472035 (SEQ ID NO: 137) : 

GGAAAGCTTCCTTAACAATGGAGGCATTAACACAGCTAAGCTTTGGCATTTGTACTCCAC 
GCCTTTCATCACCATTTCAACTAGCCGCCGCCGGTGGTAAGAAGCCGCCGAGACTCAATG 
CCGTTAACGGAGGAGCTAGTAGTGTTACCGGTGGAACAAGTAGTTTACCTACTAACTTCT 
CCGCTAGTAAATGGGCGGATCGTCTTCTCGCCGATTTCCAATTCCTTCCTTCCACCACCA 
CCTCCGACTCATCGGATTTCCAGAATTCAACTTCTACAACCTCCGTTACGACTATTCCTC 
CTCCTGTTGCTCCTTCAGACCACCACATTTCAATGCCTATAGACTTTTATAGAGTGCTTG 
GTGCTGAAGCTCACTTCCTCGGTGACGGTATTAGGAGATGCTACGATGCTAGAATTACAA 
AGCCTCCGCAGTACGGATACAGTCAGGAAGCATTGATTGGCCGACGGCAGATTCTTCAAG 
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CTGCTTGTGAAACCCTTGCTGACTCTACCTCTCGTAGAGAGTACAATCAAGGCCTCGCTC 

AGCATGAGTTCGATACTATTCTAACTCCTGTCCCCTGGGATAAAGTTCCGGGAGCAATGT 
GTGTTTTG 

BI120337 (SEQ ID NO:138) : 

GAAGATTTCATGAATGAGGCCTTCTTACGTATGACAGCAGCTGAGCAGGTTGATCTGTTC 

GTCACCACGCCAAGTAATATCCCGGCTCAAAATTTTGAAGTTTATGGAGTGGCACTTGCC 

CTTGTTGCCCAAGCTTTCATTGGTAAAAAGCCTCATCTCATCACAGATGCTGATAACCTA 

TTCGGACAGCTTCAGCAGATTAAGGTAACAAATCAAGGGAGTCTTGTTCCTGTCTTTGGT 

TCCATGGAAAACCGTGATATTGACTTTGGGTTGGAGAGGGGCTTTGTTCACTGCTTGTAG 
GCCAGCT 

AI043508 (SEQ ID NO: 139): 

GGGAAACGTGCCTTGGTGGAAGCAACTCTTCAAGAATCAGCGCAGTTAACTGACGTTAAC 

CT^CCTGAGCATAACGATTCTTACAGCAGAACATACACAACAAGGTACGAGATGTTTCAC 

TCCAATGCTGGGTGGAAGATCATAGAGGGAGCTGTCCTCCAATCTTAAGCTGCTGGAAAT 

CCAGTCTTGAATGTACATATTTTCACATCATCTGCACATTATGAATGAAGGATGGTATGT 

GTTTTCTGGACAGTGGTATTTGATCATGTTGTGTTTATTTTGGTAACAAGTTTTGATCAT 

TATCAAAAAGATCACTCTTGTAAGTTAGTTTTTTCCACAATAAATCAACTATTTATATGA 

AAGTTTTTATATCAGGACTACTTGCCTTTACTTATATAAACTTTGAGAAATTTTTT 

AU095068 (SEQ ID NO: 140) : 

TGGTGCTTCTCATTTGGGCTGCTGCTGCTGCTATTGCAAAACTTGGTGCTCAAGCTACAG 

CTGCACTTGGTACTGTGAAATCAAATGCTATTCAAGCGTTCAACAAGGTTTTNCCATTGA 

TAGAACAGTTAGACAGGTCAGCCATGGAAAATACTAAAGATGGCCCTGGGGGATATCTTG 

AAAATTTTGACCAGGAAAATGCACCTGCTCATGATTCGAGAAATGCCGCCTTGAAGATTA 

TCTCTCTGGCGCACTGTTTGCACTGTTGGCAGTAATTGGGGCCAAATATTTGCCTCGTAA 

GAGGCCCCTTTCTGCTATTAGGAGTGAGCATGGATCTGTGGCAGTTGCTAATAGTGTCGA 

CTCTACTGATGATCCTGCACTAGATGAAGATCCAGTACATATTCCTAGAATGGATGCGAA 
GCTGGCAGAAGATATTGTTCGCAAGTGGCAGAGTATCAAATCTAA 
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AU183658 (SEQ ID N0:141): 

ATCATAAGAAGCGCCAAGAAGGGCTTCAAGGTGCGAGAAACATTTTGTGGAGCGTTGGCA 

GAGGAGGTATTGCTACCGTTGGAGGAGGATTTTCTCGTGAAGCCTTCATGAACGAGGCTT 

TTTTGAGGATGACATCAATTGAACAGATGGATTTCTTTTCAAAAACACCGAATAGCATTC 

CTCCTGAATGGTTTGAAATTTACAATGTAGCACTTGCACATGTCGCTCAAGCAATTATAA 

GTAAAAGGCCACAATTCATCATGATGGCGGATGATCTTTTTGAACAACTCCAGAAGTTCC 
ACATAGGTC 

AU058418 (SEQ ID NO: 142) : 

ATCATAAGAAGCGCCAAGAAGGGCTTCAAGGTGCGAGAAACATTTTGTGGAGCGTTGGCA 

GAGGAGGTATTGCTACCGTTGGAGGAGGATTTTCTCGTGAAGCCTTCATGAACGAGGCTT 

TTTTGAGGATGACATCAATTGAACAGATGGATTTCTTTTCAAAAACACCGAATAGCATTC 

CTCCTGAATGGTTTGAAATTTACAATGTAGCACTTGCACATGTCGCTCAAGCAATTATAA 

GTAAAAGGCCACAATTCATCATGATGGCGGATGATCTTTTTGAACAACTCCAGAAGTTCA 
ACATAGGTTCTCATTATGCTTATGATAATGAGATGG 

BE490117 (SEQ ID NO:143): 

CAGTGCTTGCAATTGGAGGGCACTTACTGGAGGACCGCCCGCCCAAGCGGTTCAAGCAGG 

ATGTGGTGCTGGCAATGGCGCTCGCTTATGTGGATCTATCAAGGGACGCAATGGCGGCTA 

GCCCTCCAGATGTAATCCGCTGCTGTGAGGTGCTTGAAAGGGCTCTCAAGCTTTTGCAGG 

AGGATGGGGCAATCAATCTCGCACCTGGTTTGCTCTCACAAATTGATGAAACTCTGGAGG 

ATATCACACCTCGTTGTGTTTTGGAGCTTCTTGCCCTTCCTCTTGATGAAAAACATCAGA 

ATGAACACCAAGAAGGTCTTCGTGGTGTGAGAAACATTTTGTGGAGTGTTGGCAGAGGAG 

GTATTGGTACTGTTGGAGGAGGATTTTCGCGTGAAGCCTACATGAATGAAGCCTTCCTGC 

AGATGACATCGGCGGAGCAGATGGATTTCTTCTCAAAAACACCGAATAGCATACCGCCTG 

AATGGTTTGAAATCTATAGCGTGGCACTTGCT^TGTTGCTCAAGCAATTGTAAGTA 

BG607272 (SEQ ID N0:144): 

ACACCTCGTTGTGTtTTGGAGCTTCTTGCCCTTCCTCTTGATGAAAAGCACCAGAGTAAA 
CGCCAAGAAGGTCTTCGTGGTGTGAGAAACATTTTGTGGAGTGTTGGTAGAGGAGGTATT 
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GCTACTGTTGGAGGAGGATTTTCNCGTGAAGCCTACATGAATGAGGCCTTTTTGCAGATG 

ACATCAGCGGAGCAGATGGATTTCTTTTCAAAAACGCCAAATAGCATACCACCTGAATGG 

TTTGAAATCTATAGTGTGGCACTCGCAAATGTTGCTCAAGCAATTGTAAGTAAAAGGCCA 

NAGCTCATCATGGTGGCAGATGATCTTTTCGAACAGCTCCAGAAGTTCAATATAGGTTCT 

CAATATGCTTATGATAATGAATTGGATCTTGTGTTGGAAAGGGCACTTTGCTCATTGC 

BI949952 (SEQ ID N0:145): 

GCGAGCATGAGTCCGTGGCAGTTGCTAATGTTGTTGACTCAGGTGATGATGACGAACCAG 
ATGAGCCCATACAGATTCCTAAAATGGATGCGAAGCTGGCAGAAGATATTGTTCGCAAGT 
GGCAGAGCATCAAATCCAAGGCCTTGGGATCAGATCATTCTGTTGCATCATTGCAAGAGG 
TTCTTGATGGCAACATGCTGAAGGTATGGACGGACCGAGCAGCAGAGATCGAGCGCAAAG 
GCTGGTTCTGGGACTACACGCTGTCCAACGTGGCGATCGACAGCATCACCGTCTCCCTGG 
ACGGACGGCGGGCGACCGTGGAGGCGACAATTGAGGAGGCGGGTCAGCTCACCGACGCAA 
CCGACCCCAGGAACGATGATTTGTACGACACTAAGTACACCACCCGGTACGAGATGGCCT 
TCACCGGACCAGGAGGGTGGAAGATAACCGAAGGCGCAGTCCTCAAGTCGTCATAGGGCG 

AV833644 (SEQ ID NO: 146) : 

GAAACTCTGGNNGNAGATCACCCCTCGTTGTGTTTTAGAGCTTCTTGCCCTTCCTCTTGA 

CGAGNAAGCACCAGAGTAAACGCCAAGNAAGGTCTTCGTGGTGTGAGAAACATTTTGTGG 

AGTGTTGGTAGAGGAGGTATTGCTACTGTTGGTGGAGGATTTTCACGGGAAGCCTACATG 

AATGAGGCCTTTTTGCAGATGACATCAGCTGAGCAGATGGATTTCTTTTCAAAAACGCCG 

AATAGCATACCACCTGAATGGTTTGAAATCTATAGCGTGGCACTCGCAAATGTTGCTCAA 

GCAATTGTAAGTAAAAGGCCAGAGCTCATCATGGTGGCAGATGATCTTTTCGAACAGCTC 

CAGAAGTTCAATATCGGTTCTCAATATGCTTATGGTAACGAGATGGATCTTGCGTTGGAA 

AGGGCACTTTGCTCATTGCTTGTGGGAGACATTAGCAACTGCAGAACTTGGCTTGCGATT 

GATAATGAATCTTCACCACATAGAGACCCGAAAATTGTAGAGTTTATTGTGAACAACTCT 

AGCATTGACCACCAGGAGAATGATCTTCTTCCAGGCCTGTGTAAGCTTTTGGAGACTTGG 
CTTGTCTCAGAGGTTTTCCCTA 

AV921157 (SEQ ID NO: 147) : 
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TGGCTTCACCTGNAAATCCAGCACTAAGTTTCTCTTATCACCAACCCAAGGATCTCTTCT 

AGCCTAGCAATAATCCGAATAGAACACACCGAAAAACAAAGCTCATCGCTGACTAACTGA 

CTAACCAAACTATCTCCGTCTTCCAAACTGACAAGAGCCTAGACTAGACTGCTTATTTAC 

ACACCAGAAAAACACGGGAGGAATCAATCAACAAGGTTTACTGCACGCTGAACGCCCTAT 

GACGACTTGAGGACTGCGCCTTCGGTTATCTTCCACCCTCCTGGTCCGGTGAAGGCCATC 

TCGTACCGGGTGGTGTACTTAGTGTCGTACAAATCATCGTTCCTGGGGTCGGTTGCGTCG 

GTGAGCTGACCCGCCTCCTCAATTGTCGCCTCCACGGTCGCCCGCCGTCCGTCCAGGGAG 

ACGGTGATGCTGTCGATCGCCACGTTGAACAGCGTGTAGTCCCAGAACCAGCCTTTGCGC 

TCAATCTCTGCTGCTCGGTCTGTCCATACCTTCAGNATGTTGCCATCAAGAACCTCTTGC 

AATGATGCAACAGAATGATCTGATCCCAAGGCCTTGGATTTGATGCTCTGCCACTTGCGA 
ACAA 

BE917942 (SEQ ID NO:148): 

TATGGGTCTGTGGCAGTTGCTGACTCTGTTGATGGTCTGGGAGCAGATGAAGAGCCACTA 

GAAATTCCTAGAATGGATGCAAAGTTGGCTGAAGATATTGTTCGCAAGTGGCAAAGTATC 

AAGTCCAAGGCTTTGGGGCCAGAACACACTGTCACGGCATTGCAAGAGATCCTCGATGGC 

AACATGCTGAAGGTATGGATGGACCGAGCCACAGAGATTGAGCGTCACGGTTGGTTCTGG 

GAATACACACTCTCCGACGTGACGATCGACAGTATCACCGTCTCCATGGACGGTCGACGG 

GCAACTGTGGAGGCGACGATTGAGGAGATGGGCCAACTTACCGACGTAGCAGACCCAAAG 

AACAACGACGCCTACGACACAAAGTACACCGCTCGGTACGAGATGAGCTACTCCAAGTCC 

GGAGGGTGGAGGATCACCGAAGGAGCAGTCCTCAAGTCGTAGAACGGTCGTGCAGCAGGA 

GTAGGCGAGTAGGGGTTGCTCAACTCCCATTCTTTTTTCTTTTGCACCAGTGTATGTAAA 

TAAACAGTGTGAGCACAGGTTCTTTTCTCTCCTGGAGAGAGTTTGGTTAGGTTGATTAGT 

GATGAGTTCCTGAGGCCGAGAGAATTTGTCATCTAGTTTGTATTGATAGAGAT 

BE918523 (SEQ ID N0:149): 

GCACGAGGATAGAACAGCTAGACAGATCAGGCAAGGATACCCCAGGTGATGATCTTGAGA 
AATCTCTTGAAAAACTTGCCCAAGAAATGTTGCTGGAGATGCTATCCATGATTCCAAAAA 
TGCCGCTTTGAAGATTATCTCTGCTGGTGCACTGTTTGCACTATTTGCAGTAATAGGTCT 
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GAAGTGCTTGCCTCGTAAGAAGTCACTTCCTGCTCTTAAGAGCGAATATGGGTCTGTGGC 

AGTTGCTGACTCTGTTGATGGTCTGGGAGCAGATGAAGAGCCACTAGAAATTCCTAGAAT 

GGATGCAAAGTTGGCTGAAGATATTGTTCGCAAGTGGCAAAGTATCAAGTCCAAGGCTTT 

GGGGCCAGAACACACTGTCACGGCATTGCAAGAGATCCTCGATGGCAACATGCTGAAGGT 

ATGGATGGACCGAGCCACAGAGATTGAGCGTCACGGTTGGTTCTGGGAATACACACTCTC 

CGACGTGACGATCGACAGTATCACCGTCTCCATGGACGGTCGACGGGCAACTGTG 

-Biyi498278 (SEQ ID NO: 150) : - - _ ^ 

GCCACAGGCCGCCACCGCCTGGCCCCTCCACCTGCCGCTCCGCCAGCCGCTGGGCCGACC 

GCCTCTTCGCCGACTTCCACCTCCTCCCCGCCGCCGCCGACCCGCCAGCCGCGGCCTCCT 

CTTCCTCCTCGTCCCCGTTCGTCCCGATCTTCCCCGAAGCCGCCGACCGCGCCTTGCCCC 

TCCCGGTCGACTTCTACAAGATTCTTGGTGCGGAGCCACATTTCCTAGGCGATGGCATTC 

GGAGGGCGTTCGAGTCGCGGATAGCTAAGCCACCTCAGTATGGGTACAGCACAGAAGCTC 

TTGCTGGGCGACGGCAAATGCTGCAGATTGCCCATGATACTCTCACAAACCAGAGCTCGC 

GCACCGAGTACGACCGTGCGCTTTCCGAGGACCGTGATGCGGCACTCACCATGGATGTTG 

CCTGGGATAAGGTTCCAGGTGTGCTGCGTGTGCTTCAGGAGGCTGGGGAGGCACAACTG 

BM498757 (SEQ ID NO: 151) : 

AGCAATGTGGGCAAGTGCGACACTATAGATCTCAAACCATTCAGGTGGTATGCTATTCGG 

TGTTTTAGAGAAGAAATCCATCTGCTCAGCTGATGTCATCTGCAAGAAAGCCTCATTCAT 

GAAGGCCTCACGAGAAAATCCTCCTCCAACAGTAGCAATACCACCCCTGCCAACACTCCA 

CAATATGTTTTTTGCACCTTGCAGACCTTCTTGGCGTTTATTTTTATGTTTTTCATCAGT 

AGGAAGAGCAAGAAGCTCCAATACACAACGAGGTGTAATCTCCTCCAAAGTTTCATCAAT 

CTGTGCAAGCAGTTCAGGTGCAAGATTGCTTGCACCATCCTCCTGCAGGAGCTTCAGTGC 

CCTCTCAAGCACCTCACAACAGCAGATTACATCTGGAGGGCTTGCTGCCATAGCATCCCT 
TGATATGTCCACATAAGCCAATGCCA 

AW331058 (SEQ ID NO:152): 

CGCGTCGACGTATAGAGTCTGCATCCATGTTGCCTTGAATGAAGCGTCTGCAAAAGAAGG 
CTCTTTTATCACCAGTCGTGTCAGGAAGCATTTTGAAAATATATCAAAATTTCTTTGGCT 
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GAGTGATAGGCCTAATTCAAATAGCAAAGGAAGTGATAAACACCCAGCGGTTAATGATAT 

TACTGCTGCAGTTTGCAAGCAAAAGATGGATATTCAAGAAGCAGAAACACTTGTAAAACA 

GTGGCAAGACATAAAATCTGAAGCTCTTGGCCCTGACTATCAAACTGACATGCTACCTGA 

GATTCTTGATGGTTCAATGCTCTCTAAGTGGGAAGACTTAGCGTTATTAGCAAAGGACCA 

GTCTTGCTATTGGAGATTTGTGCTGCTAAATCTTAATGTTGTTCGAGCCGAGATAATCTT 

GGATGAAATAGGTGCTGGTGAGGCAGCAGAAATTGATGCTGTACTTGAGGAAGCGGCTGA 

GCTTGTTGACGATTCCCAGCCCAAGAAACCGAGT-TATTACAGCACATATGAAGTTCAGTA 

CGTATTGAGGAGGCAGAATCATGGATCTTGGAAAATCTCCGAGGCTGCTGTCCGGGACCT 

GACGTGATTTCTGCCAACTCGGCAAACGGGCTACACAACCATTGGCGTATAGGCGGC 

BE641509 (SEQ ID NO:153): 

GTGGTGTCTTTGCTCGTGTTCCTGGATACACAAGGGATGAGTATATGAAGGCAGCTTTTT 

CTCGAATGACAGCTGCTGAGCAAGTAGCTTTGTTCACAAATACACCCAGTAATATCCCAG 

CAGAGAGTTCTGAGGTTTACACAGTTGCGCTTGCTCACATAGCAGAGGGATTTGTTGCAA 

AGAAGCCGCAATTGATTCAGGAAGCTGATTCACTCTTTCTTCAGCTTCAGCGAACAAATG 

CCTCATCATCTAGTTTGCTAGTTACTGGTGGTCTACGGCCATTATCAAGTCTGCAGCTTG 

ATTTTGCTTTTGAACGAGCCATGTGCAAACTGCTCCTAGGAGAACTGGATGGTTGTCGTG 

CATGGCTAGGTTTGGATGATACAAACTCTCCATATAGAGACCCTGCAGTGACtGATTTTG 

TTATAGCTAATTCTTTTGGAAGTGAGGAAGGTGATTATTTACCAGGCCTTTGCAAGTTGT 

TGGAAAGTTGGTTGAGGGAAGCGGTGTTTTTCCCCAACCCGTCAACAGAAAAGTGGAGGT 

ACAAGTTGAGGGAGTATTTTTTATGATGCAAGGAGAAAAAAAGCCGCCGTGAATTTTTTC 

GCGGGGGGCGCTATGAAAAAATATATTCAACCTTTTTTTGTTGGGGCGTCGTCTACAAAG 

AATGATGGAGTGTCATTGTTGCTTTTGAGGTGACGAAGGGGCGGCGCTCCTCTTTAAGGG 

ATCGTCCGTGGGGGCGCGCGCTCCCATATCGCCATCTTCGGGACACCTTGTTCGTGGGTC 

AAATGGTGATGTCTTTTTTACCACGAACGTCACATTATTCTTATAATATAAGCGTGCGGC 
AGCACTCTCAGCTTCGACGAAACAGCCTAAA 

BI437111 (SEQ ID NO:154): 

GAGAACGGAAGCTTTAGAAGTGGAGGTTGTCCCCAAAATGGATGCTAGGTTGGCGGAAAT 
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TATGGTTCGAAGATGGCAAGCAGCTAAAGCTCGAGCACTTGGTTCTGCTCATGATATGGC 

GGCTCTTCCTGAGGTGCTGGAGGGCGAGATGCTGAAGAGCTGGACAGACCGTGTTAGTGA 

CGTCAAGAGAAATGGTTGGTTTTGGGAATACACTCTCCTTGGTCTTCACATTGATAGTGT 

AACAGTAAGTGACGATGGGAGGCGAGCAACTGCGGAAGCCACTTTGCAAGAGGCAGCCCG 

CTTGGTGGACCGCAACAACCCTGACCACAATGATTCTTATAGAAGCACTTACACTACGCG 

ATATGACCTCCGGCATGGCATAGATGGTTGGCGAATCAATGGAGGAGCTGTGCTGCGTAC 

TTGATTCTGAGATTTTCATCTCCGGATCATGTTGACTTGTAGGCAGATCGACTAGTTGCA 

ACCCTTGCATGCTACGAATGAGTAGTCTTTTTGGATATTTTGATCCATCATGCAGCTTTG 
A 
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Prochlorococcus marinus sp. MED4 (SEQ ID NO: 155): 

ttggaacttccattagatcactttcgtttaataggcgtaagcccctcagcaacatctgaggaaatattaagggct 
ttcca 

attacgcttggataaaactcctgatgaaggattcacgtacgaggttttaactcaaaggtcggaattgcttcgcct 

tactg 

cagatttgcttacagatccagatagtagaagagattacgaaaatttattactaaatggagcatcaggtttagatt 
tatct 

tccaatagagaggttgcaggattaattctcctttgggaatcgggctcttctaaagaagcctttaaaataacaaga 
aaagc 

attgcaacccccccaaactcctgcattgggtagcagtagagaagctgatcttaccttgttagcggctttaacatc 
tagag 

atgctgcaatacaagagcaagatcaaagatcttactcaaatgctgcagattttttacaagaaggcatacagcttc 

ttcaa " " - - 

agaatgggcaaactaggggaattacggaaaactcttgaggaggacttagtgtcgcttcttccgtatcgaattctt 

gattt 

gttaagtagagatctaaatgattatgactcgcataaaaaaggtttaagtatgctggaaaatttaataatcaaaag 
aggtg 

gattagaaggaaaaaataaatctgaatataatgattttctaaatcagcaagaatttgaatctttctttcaacaaa 
taaag 

ccattcttgactgttcaggatcagatagatttatttttagaattacaaaaaaggggttcaagtgaagcaggattt 

ttagc 

ttttttatctttaacagcaattggttttgcaagaagaaaacctgcaaaattattcgaagctcgaaaaatattaaa 
aaaac 

taaatttatcaggacttgactcaatgccattaataggttgccttgatttgcttttagcagatgttgagcaatcct 
cagca 

aggtttttaagtagttccgatgagaagttaagagattggttgaataattatcctggagaaaaattagaagcaata 
tgtat 

tttttgtaaaaattggttagaaaatgatgttttggttggttatagggatattgatttaaaagaaatcgatttaga 

ctctt 

ggtttgaagatagagaaatccaagaatttattgagcaaatagaaaagaagtcaaatagaactgtgtttaagtctg 
ggcct 

caaaataaacctatttttcaagcccaagaatctttaaaagattcaagtacgggccctgatttaaattcggataat 
tttga 

agaaggccgattacctttgcctggaggagtaagagaagatggtcaagaagttattgaagaaaatatttatacaga 
tgaga 

ttattaaaaacaaatcaatagaattttataagtacgcaatagaaaaaattgctgaattaaaatttgtatttggag 
aagcc 

ttagagaactacagaatatttaataaatcttcctacctaacatatctgtatgcttttttgattttatttgctttt 
ggcct 

aggtgttggatttgtaagaaataatctcaaaaaacccgtgcaggaaaaagaaataattgataactcgttatcgat 
aaatg 

aaaataagaatgtcttttatgaaggtttaaatcaagatgataaaaagaaagttctcgataactcaaaaattattc 
tctca 

gataatgcagaaaaagttattttttcaggtgaagaaataaaaactgcttctccctccttagaaaaaatagaaaat 

ttaat 

taatacatggcttgttaacaaaagtaaatttctagcaggaaaaggtgaaattaatttatcaaagatagttcaaga 
tgatt 

tgattgatagattaaagaaggaaagagaacttgatattcaaaaaggtatctacaaaaatatcaatgctaatatcg 

aaaat 

attgtacttttaactcaaacggcatcaagaatatcagtatcagttgacttaaagtattcagaaaaaatattaaaa 
ataga 

tggggaattgataaatgaaacaactttcactccttttttgaaagttaaatatattttaggtttctcaaataactc 

ctgga 

aattagttgactacattagtggtgtttag 
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PROTEIN (SEQ ID NO: 156) : 

LELPLDHFRLIGVSPSATSEEILRAFQLRLDKTPDEGFTYEVLTQRSELLRLTADLLTDPDSRRDYENLLLNGAS ' 
GLDLS 

SNREVAGLILLWESGSSKEAFKITRKALQPPQTPALGSSREADLTLLAALTSRDAAIQEQDQRSYSNAADFLQEG 
IQLLQ 

RMGKLGELRKTLEEDLVSLLPYRILDLLSRDLNDYDSHKKGLSMLENLIIKRGGLEGKNKSEYNDFLNQQEFESF 
FQQIK 

PFLTVQDQIDLFLELQKRGSSEAGFLAFLSLTAIGFARRKPAKLFEARKILKKLNLSGLDSMPLIGCLDLLLADV 
EQSSA 

RFLSSSDEKLRDWLNNYPGEKLEAICIFCKNWLENDVLVGYRDIDLKEIDLDSWFEDREIQEFIEQIEKKSNRTV 
FKSGP 

QNKPI-FQAQESLKDSSTGPDLNSDNFEEGRLPLPGGVREDGQEVIE_ENIYTDEIIKNKSIEFYKYAIEKIAELKF 
VFGEA 

LENYRIFNKSSYLTYLYAFLILFAFGLGVGFVRNNLKKPVQEKEIIDNSLSINENKNVFYEGLNQDDKKKVLDNS 
KIILS 

DNAEKVIFSGEEIKTASPSLEKIENIilNTWLVNKSKFLAGKGEINLSKIVQDDLIDRLKKERELDIQKGIYKNIN 
ANIEN 

IVLLTQTASRISVSVDLKYSEKILKIDGELINETTFTPFLKVKYILGFSNNSWKLVDYISGV* 
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DRAFT Prochlorococcus marinus sp. (SEQ ID NO: 157) : 
MRNA 

gtggacctgccaatagatcatttccgcttgctgggtgtcagtccttcggcagacagtgaggcgattttgcgggcc 
ttgga 

gttgaggttggatcgctgccctgaccaaggtttcacccatgaggtcttaattcagcgggcagaattgttgcggct 
ttcag 

cagatttgctgactgatccgccacggcgtcaggcctatgagactgccttgttggagctcagtcgtgatcatccag 
gtgag 

accgccggtcttgatgtgtcacctagtagagaggtggcagggctgatcttgctgtttgaagcgaattcttctcat 
gaggt 

ttttcatctcgcctctcagggattgcaaccgccccagtccccgacgctaggtagcgaacgagaagctgacctcgc 
tttgt 

tgttggcactggcctgtcgggetgcagccgctgaggaacaggaacaacggcgttatgaagcagcagcgtctcttc 
tgcat 

gacgggatccagttgctgcagcggatgggcaagctctccgaagagtgccacaagcttgagaacgatttagatgcc 
cttct 

gccctatcgcattctcgacttattgagtcgggatcttggtgatcaggtttctcaccaggaaggactgcgcctact 
tgaca 

actttgtgagccagagaggaggtcttgagggaacggccccatcgcctgcacctggtggtcttgatcagtccgaat 
ttgac 

aacttcttcaagcagatcagaaagtttttaactgttcaggaacaggttgatcttttcctgcgctggcagcaagcc 
ggatc 

agcagatgcgggtttcctgggtgggttggctcttgctgctgttggattttcgcgtcggaagcctgaacgggtgca 
ggaag 

ctcggcagcacttagagaggcttcaactggatggatgcgacccgttgccgatgctgggttgcttggacctcttgc 

tcgga 

gatgtgggccgcgctcaggagcgttttctgcgcagtacagatcctcgagtgaaggactgtcttaacagccaccct 
ggcga 

tgaattggctgctttttgtgagtactgccgctcttggctgcgaggggacgtgcttcccggttatagggatgtgga 
tgctg 

aggccgttgatctagaggcttggtttgctgatcgggatgttcaggcttatgtggagcgcctggaacgcagcgaaa 
atcgt 

gcttcttctttaggtaaggccttctcaggatcgtctgtgaagcaacccttcccttgggcgcctcttgatcccgat 

gggat 

tttgcccctctctcttggtgggcctgatgttggtcaacctgcagctgatcagagctctgatgagtttgccagcga 
tggta 

tggcatggattgatcgtttagcagatctgccacgcccgacgcggccggtgctgatcggttcggttgtctttgcgg 
ccctg 

attgcagcctttgcaggcttcagtttgtttggccaacgtcctcgtacgtcagttagtacggctgctgatcagcct 
caagt 

cacagcacctcctacagccacactgcaagaggaggtcctcatgcctcaagtccctgtcagcgctgtggttgagcc 
gctta 

ctttggagcagccgaatgaggcacagctcaaaggcctgcttcaggcctggctcagcaacaaggcagtcgtgcttg 
ccggt 

ggcaagagtgatgcactgcctgaggtcgcaagagatccattggtgcagcgcgtggcgcaagagcgtgccagggat 

gctgc 

tttagctcagacccagaaggttgtggccagcatcagctctgtagaggtggtgagtcgaacgccgcagcgtattga 
gctga 

atgccgttgtgacctatcgcgatcaacgcgttgatgctgccggcaaggttgttgaccaaacgccccaaaaagatc 
tctcg 

gtgacttacatccttggtcgtgatcccgatcgttggcgcctgcatgaatacatcagcggcaaataa 



PROTEIN (SEQ ID N0:158): 
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VDLPIDHFRLLGVSPSADSEAILRALELRLDRCPDQGFTHEVLIQRAELLRLSADLLTDPPRRQAYETALLELSR 
DHPGE 

TAGLD VS PSREVAGL I LLFEANS SHE VFHLASQGLQPPQS PTLGS EREADLALLLALACRAAAAEEQEQRR YEAA ^ 
ASLLH 

DGIQLLQRMGKLSEECHKLENDLDALLPYRILDLLSRDLGDQVSHQEGLRLLDNFVSQRGGLEGTAPSPAPGGLD 

QSEFD 

NFFKQIRKFLTVQEQVDLFLRWQQAGSADAGFLGGLALAAVGFSRRKPERVQEARQHLERLQLDGCDPLPMLGCL 
DLLLG 

DVGRAQERFLRSTDPRVKDCLNSHPGDELAAFCEYCRSWLRGDVLPGYRDVDAEAVDLEAWFADRDVQAYVERLE 
RSENR 

ASSLGKAFSGSSVKQPFPWAPLDPDGILPLSLGGPDVGQPAADQSSDEFASDGMAWIDRLADLPRPTRPVLIGSV 
VFAAL 

lAAFAGFSLFGQRPRTSVSTAADQPQVTAPPTATLQEEVLMPQVPVSAWEPLTLEQPNEAQLKGLLQAWLSIJKA 

WLAG " " " " - 

GKSDALPEVARDPLVQRVAQERARDAALAQTQKWASISSVEWSRTPQRIELNAWTYRDQRVDAAGKVVDQTP 

QKDLS 

VTYILGRDPDRWRLHEYISGK* 
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Synechococcus sp. PCC7002 {SEQ ID NO:159): 

GTGCGCATTCCGCTCGACTATTACCGCATCCTATGCGTCCCCGCCAAGGCAACCACTGCCCAAATTACCCAAGCC 
TATCGCGATCGCCTCTCCCAATTTCCCCGTCGCGAACATAATGCCTTGGCCATTGAGGCCCGCAACCGGATTATC 
GAGCAAGCCTTTGAGGTGTTATCCCAAACAGAAACCCGCGCCGTCTACGACCATGAGCTGTCGGGCAATATGTTT 
CGTTCCCTCGTCCCCAGCCGTCCGAAACTGCCTTTTCCCGATCGCCCCTCCAGTGACACAGAGTTAGAAGCCCTG 
ACAGCCCACCAACCAACCATTGACATCGCGGAAAAAGATTTACTGGGGGGACTGCTGTTACTCCTCGACCTGGGG 
GAGTACGAATTAGTGCTGAAGTGGGCTGCCCCCTACCTCAAGGGCAAAGGCAAGCTGGTCAAGGAAGGGAAATTT 
GGGGCCGTCGAAATCGTCGAGCAAGAACTACGGCTTTGTTTGGCCCTGGCCCACTGGGAATTGAGCCGGGAACAG 
TGGCTCCAACAACATTATGAACAGGCGGCTCTCTCCGGTCAGAAGAGTCAAGAGCTATTGGTAGATGTGGCACAA 
TTTGCAGACCTCCAACAGGAAATTCAAGGGGATCTCAATCGCCTCAGACCCTATCAAGTTCTAGAACTTCTGGCC 
CTACCCGAATCAGAAACCCAAGAGCGACAACGGGGCTTACAACTGCTCCAGGAAATGTTGAGTGCTCGCGTGGGG 
ATTGATGGCCAGGGGGACGATCAGTCGGGTCTAAGTATTGATGATTTTTTGCGCTTTATCCAGCAGTTACGCA5T 
TATCTAACGGTGCAAGAACACTTGGATCTCTTTGTGGCAGAATCAAAGCGACCTTCGGCGGCAGCGGCCTACCTA 
GCGGTGTATGCTCTCTTGGCTGCTGGGTTTTCGCAACGGAAACCTGACCTGGTCGTGCAAGCCCAGACCCTATTA 
A7VACGCCTCGGCAAACGCCAGGATGTTTTCTTGGAGCAATCAATCTGCGCCTTACTTTTAGGTCAGCCGTCGGAA 
GCCAATCAACTGTTAGAACAAAGTCAGGAACAGGAGGCGATCGCCTACATTCAAGAGCAGTCTGAGGGGGCACCG 
GATCTACTCCCAGGCCTATGTCTCTACGGGGAACAGTGGCTGAAGACAGAGGTTTTTTCCCATTTCCGCGATCTC 
CGGCAACGGCTTGAAGATGGCTCTGTTTCGTTGACGGCTTACTTCGCCGATCCTGAAGTGCAGCAATATCTTGAC 
GATCTCCTCACGGAGGCTGTCCCCACACCCACACCACATCCAGACACAGAAAGTACAGCGGCCCCGTCGGAAAAG 
CCACCGGAAACATTACAGTCAGAAACCGGTGTTTCGCCGCATCCCAGTCGTCCCGCCAAGGTTGATTCCTTTGAG 
GATCTCGTCACTCAAACTCCCGCTACAGTTCCCCCGGCACCGCCTTCTCCTGGTGTAGCACCTGTAACTGCGGCA 
TTAAACCCAGACCCGGAAGCGTCTTCTGCTTCGTCAAAATCAGTTTCGTCAAAAAAGTCTATCGGGCCTTGGGGG 
GCGATCGCCGCTATCGTGGGGAGTGTTTTGCTGGTCGTGGGCCTGGTGCGAATTTTGTCTGGCCTAACTACCCAG 
GAACCCTTACAGGTCACCCTCAACGGTGAGCCACCCCTAACGATCCCCAGCTTAGACACCGCCGAGGCAAATAAT 
AATCCGGAGAATGGAGCGACCGATACAACGACAACGCCTGCGCTCAATGAGGCGATCGCCGCTGAGGTGATTCAA 
ACTTGGTTTGAGAGTAAAGCTAGAGCCTTTGGCCAAGACCGTGATTTGGCGGCTCTAGAAAATATTTTGGCAGAA 
CCGTCCCTGTCCCGCTGGCGCAGTAGTGCCCAGGCCGTCCGCAGCGCTGGTACCTACCGCACCTATGACCACAGT 
TTGACCATTGAAACGGTGAGCTTCAACCCAGACCAACCCAATGTGGCGACCGTTGAGGCCCAGGTGCAGGAAAAG 
GCAGATTATTACCGGGCGAATGGGGAACGCGATCCCGGCCAGTCCTATGATTCTGACCTGCGTGTCCGCTACAGC 
TTGGTGCGCCAAGGCGATCGCTGGTTGATTCGTTCTTCCCAAACCCTGTAA 



Protein: (SEQ ID NO: 160) : 

>Scc_7002_Sequence 1 ORF:57453.. 55303 Frame -2 

MRI PLDYYRILCVPAKATTAQITQAYRDRLSQFPRREHNALAIEARNRI lEQAFEVLSQTETRAVYDHELSGNMF 
RSLVPSRPKLPFPDRPSSDTELEALTAHQPTIDIAEKDLLGGLLLLLDLGEYELVLKWAAPYLKGKGKLVKEGKF 
GAVEIVEQELRLCLALAHWELSREQWLQQHYEQAALSGQKSQELLVDVAQFADLQQEIQGDLNRLRPYQVLELLA 
LPESETQERQRGLQLLQEMLSARVGIDGQGDDQSGLSIDDFLRFIQQLRSYLTVQEQLDLFVAESKRPSAAAAYL 
AVYALLAAGFSQRKPDLWQAQTLLKRLGKRQDVFLEQSICALLLGQPSEANQLLEQSQEQEAIAYIQEQSEGAP 
DLLPGLCLYGEQWLKTEVFSHFRDLRQRLEDGSVSLTAYFADPEVQQYLDDLLTEAVPTPTPHPDTESTAAPSEK 
PPETLQSETGVSPHPSRPAKVDSFEDLVTQTPATVPPAPPSPGVAPVTAALNPDPEASSASSKSVSSKKSIGPWG 
AIAAIVGSVLLWGLVRILSGLTTQEPLQVTLNGEPPLTIPSLDTAEANNNPENGATDTTTTPALNEAIAAEVIQ 
TWFESKARAFGQDRDLAALENILAEPSLSRWRSSAQAWSAGTYRTYDHSLTIETVSFNPDQPNVATVEAQVQEK 
ADYYRANGERDPGQSYDSDLRVRYSLVRQGDRWLIRSSQTL 



"Replacement Sheet" 



FIG. 8 coEtmued 21/40 



ACCESSION AF421196 (SEQ ID NO : 161) : 

1 cttgccgact aaaggctaag catcgccatt ccttagatta aagcagtctg tcggcggcgc 
61 tgtgccggtt aacaccagtc tgtcgctgac agcggtgcct ttctggggct tgcctgtggg 
121 gcgagtaacc gatcgctggg ataagagttg gtgcttctgg ctctcaagaa tagggttttc 
181 cgtcgcgtat tcccgatcac atccccctgt gtctgctacg gagataacgc cgatcactca 
241 acagaattgg taagttgacg gtcaagttgg gatgatgaag tcggctcaag ctggcgatcc 
301 ggatctggtg ggtgttctgt gcgtattcct ctcgattact accgaattct ctgtgttggc 
361 gtgcaagcct cggcagacaa acttgccgaa agctaccgcg atcgcctcaa ccaatcgccc 
421 tcccatgagt tttcagagct ggcattgcag gcgcggcggc aactcctcga agcagcgatt 
481 gctgagctga gtgatcccga acagcgcgat cgctacgatc gccgcttttt tcagggcggt 
541 ctggaagcga ttgaaccaag cctagaactc gaagactggc agcgaattgg agccctgcrtg 
601 atcctgctgg aattggggga atacgatcgc gtttcgcaac tggctgagga actcctgcca 
661 gactacgacg cgagcgcaga agtacgcgat cagttcgcgc ggggtgatat cgccttggcg 
721 atcgcactat cccagcaatc cctcggtcga gaatgccgtc agcagggtct gtacgaacag 
781 gccgcccagc actttggccg cagccagtct gccctagccg atcatcagcg ctttcctgaa 
841 ctgagtcgaa ccctgcacca agaacaagga cagctacggc cctatcgcat tttggagcgg 
901 ttggcccagc ccttgactgc cgatagcgat cgccagcagg gtttgctgtt gttgcaggcg 
961 atgttggacg accggcaggg cattgaaggc cctggggatg atggctcggg gctgaccctt 
1021 gataactttt tgatgtttct ccagcaaatt cgcggctatc tgaccctggc tgaacagcag 
1081 ttgctgtttg aatcggaagc gcgtcggccc tcgccggctg cgagcttttt tgcctgctac 
1141 accctgattg cgcggggctt ttgcgatcac caaccctcgt tgatccatcg cgccagcttg 
12 01 ctcttgcatg aactcaagag ccgcatggat gtgcacatcg aacaggcgat cgccagccta 
1261 ttgctcggac agcccgaaga agctgaggcg ctactcgtcc agagccaaga tgaggaaacc 
1321 ctcagccaaa tccgtgccct agcccaaggg gaagccctga tcgtcggttt gtgccgattc 
1381 acggaaacct ggctagcgac caaggtattt ccggatttcc gcgacctcaa ggaaaggact 
1441 gcgccgctgc agccctactt tgacgacccc gatgtccaga cctatctgga tgcgatcgtg 
1501 gagttgccgt ccgatttgat gccaacgccg ctacccgttg agccgcttga ggtgcgatcg 
1561 tcgttgctgg ccaaggaact gccgacccca gcaacgcctg gtgtagctcc accccctcgc 
1621 cgccgtcgcc gcgatcgctc cgaacgtcct gctcgcacgg ccaaacgctt gcccttgccc 
1681 tggattggtt tgggggttgt ggtggttctc ggcggtggaa caggggtttg ggcttggcga 
1741 tcgcgttcca attccacccc gccgaccccg ccccccgtgg ttcaaacgct gcctgaggcg 
1801 gtacctgccc cttcgcccgc gccagttacc gttgccctcg atcgggctca ggctgaaact 
1861 gtgttgcaaa actggttggc cgctaaagct gcagccttgg ggcctcaata cgatcgcgat 
1921 cgcttagcga cggtgctgac cggtgaggtt ctgcagactt ggcagggttt ttctagccag 
1981 caggccaaca cccagctcac atcacagttc gatcacaagt taaccgtcga ctcagttcag 
2041 ctcagtgacg gtgatcaacg agcagtagtc caagccaagg tcgatgaagt tgagcaggtc 
2101 tatcgaggcg accagctgct cgaaacgcgc cgagatttgg gcttggtgat ccgctaccag 
2161 ctcgtgcgcg agaacaacat ctggaaaatt gcttcgatta gtttggtgcg ctaggaattc 
2221 gcaaggggtg aaccccctgc ggtcttttct gtagatcccc tagagcgatc gcagaatgtt 
2281 cagcgattcc tggatgtgcg cttgggcatt caagagtgaa tcaaaaatgt ggcgcacctt 
2341 gccctctttg tcgatcacat aagtgacgcg acccggaatc acaaacaggg ttttgggcac 
2401 gccataggtt tgacggaggc gatcgcctgc atcgctcagc agttggaagg gcaagttgta 
2461 tttctgggc 
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AF421196_1 (SEQ ID NO;162): 

1 mripldyyri Icvgvqasad klaesyrdrl nqspshefse lalqarrqll eaaiaelsdp 

61 eqrdrydrrf fqggleaiep sleledwqri gallillelg eydrvsqlae ellpdydasa 
121 evrdqfargd ialaialsqq slgrecrqqg lyeqaaqhfg rsqsaladhq rfpelsrtlh 
181 qeqgqlrpyr ilerlaqplt adsdrqqgll llqamlddrq giegpgddgs gltldnflmf 
241 Iqqirgyltl aeqqllfese arrpspaasf facytliarg fcdhqpslih raslllhelk 
301 srmdvhieqa iaslllgqpe eaeallvqsq deetlsqira laqgealivg Icrftetwla 
361 tkvfpdfrdl kertaplqpy fddpdvqtyl daivelpsdl mptplpvepl evrssllake 
421 Iptpatpgva ppprrrrrdr serpartakr Iplpwiglgv wvlgggtgv wawrsrsnst 
481 pptpppwqt Ipeavpapsp apvtvaldra qaetvlqnwl aakaaalgpq ydrdrlatvl 
541 tgevlqtwqg fssqqantql tsqfdhkltv dsvqlsdgdq rawqakvde veqvyrgdql 
601 letrrdlglv iryqlvrenn iwkiasislv r 
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AP003590 (SEQ ID NO:163): 

ATTATGTTGATCACGGTGCAGGGGAAGTACGCTGTGCGAATTCCGCTAGATTACTACCGAATTTTAGGGCTACCG 
TTAGCGGCAAGTGATGAACAACTGCGACAAGCATACAGCGATCGCATTGTCCAATTGCCGCGACGGGAGTATTCT 
CAAGCAGCAATTGCTTCCCGTAAACAACTTATAGAAGAAGCTTACGTGGTTTTATCAGATCCAAAGGAACGCAGC 
AGTTATGACCAGCTGTATCTTGCTCACGCCTACGACCCAGACAACGCGGCTACAACCAAAGTGGCAGTGGAAAAT 
CGTGGGGACAGCAACAATGGTCATTTCGATGTCCAAAGCCTGAGCATCGAAGTTTCCTCCGAGGAATTAATTGGT 
GCTTTATTAATTTTGCAAGAGTTGGGAGAGTATGAACTCGTACTCAAGTTAGGTCGTAATTACTTAGGTAATCAA 
AACGGCACAGCATCCACCAGAAATGGCAATCATCGCACGCCTGAAGAATTTCTCGATAGTTCTGAACGTCCAGAT 
ATTCTCTTGACTGTTGCTTTGGCCTCATTAGAATTAGGGCGGGAACAATGGCAACAAGGCCACTATGAAAACGCT 
GCTTTGTCTTTAGAGACTGGGCAAGAAGTGCTGTTTAGTGAAGGCATCTTCCCCAGCGTCCAGGCAGAAATTCAG 
GCTGATCTTTACAAATTACGCCCTTATAGAATTTTAGAATTACTTGCCTTACCCCAGGAAAAAACCATTGAACGC 
CACCAAGGGeTGGATGTAT-TAGAAAGCATCTTAGAGGATCGCGGTGGCATTGATGGTACAGGCAATGATCAATCA 
GGCTTAAACATTGATGACTTCCTCCGATTCATCCAGCAATTACGCCACCACTTAACAGTGGCTGAACAACATAAG 
TTGTTTGATGGTGAAAGCAAACGCCCTTCGGCTGTGGCTACATACTTAGCTGTTTATGCTTCCATCGCCAGAGGA 
TTCACCCAACGCCAGCCCGCTTTAATTCGTCATGCCAAGCAAATTCTGATGCGTTTGTCTAAGCGGCAAGATGTG 
CATTTAGAGCAGTCCCTGTGTGCGCTATTACTAGGGCAAACTGAAGAAGCCACGCGAGTTTTAGAACTGAGCCAA 
GAATACGAAGCTTTAGCCTTAATTCGAGAAAAATCTCAAGATTCACCCGATTTACTGCCAGGTTTGTGCTTATAT 
GCCGAACAATGGCTGCAAAATGAAGTTTTCCCCCATTTCCGCGATTTGTCCAGACAGCAAGCTTCCCTGAAAGAT 
TACTTTGCTAATCAACAAGTACAAGCGTATTTAGAAGCCTTGCCCAACGACGCGGAAACCACTAATGAATGGGCT 
GTAATTAACCGCCAATCGTTTTCTCAACCCAGGGGCAATTCTTACTCTGGAGGAACGCCAGTCGCCAAACGTCCC 
GTAGGGAAGGCGAACAGGCCAGGAGAAGCGTCCACAAGACCAGTTCCCCAACGTAGTCATCCATCAGAAGTAAAT 
CGGCAGTTTCATCAAAACAGAACCCCTGATCCCGAATTACCAGAAACATCAAACCACAGAAGACCAGAGTCTTCA 
AATTTTACAACTGCTAGAGAAAATATATCGACCACAGATGCTTACACTGACAATTATCCACCAGAGATCCCTGTA 
GAACGCGCCAGCAGACCTGTTCAGCCGGGGGTAAGTGGTTATACCCAATCGACCCCTCCACGGCAAACTCCTAAA 
CGCAGGAGACGCAAGAAGCCACAGGCAGTTGTCAACAGAGGACACAGTATTCATCAGCAACGCCAACCCTCACCT 
AGCACTCTAGGCCGGAAAACAAGATTACTTTGGATAGTTTTGGGTTCTTTGGGTGGGATATTATTGTTCTGGCTG 
ATAGTCTCAACGACTTTTGGGTGGTTAAAGAATGTATTCTTCCCAGCACCATCTTTACAAGGTGAGCAATTATCG 
ATTCAGATTAGTCAACCACCTTTAGAGATTCCTGACAAAAATGCCCAGATACAATCCCCAGAGGTGAGTCTCACA 
GAAGAAACGGCAAGGAAAATAATTGAAAATTGGTTGGCTACCAAAGCTAGTGCTTTAGGCGCTGAACATAAAATT 
GAGAGTTTAAACGAGATTTTAACTGGTTCAGCGTTATCTCAATGGCGGCTAATTGCCTTGCAAGATAAAGCAGAC 
T^TCGTCATCGAGAATACAGTCATAGTGTCAAGGTAGACTCCATCAGTAAATCTGACATAGATCCCAATCGTGCA 
AGTGTGGGGGCTACAGTCAGAGAGTTAACCCAATTTTATGAGAATGGGCAAAAAGGGAAGTCTTCTGACGAAAGA 
TTACGTGTACGCTATGAATTGATTCGACAAGATGATATTTGGCGGATTCAGAGGATGTCAGCCGCTATAAATTAA 

BAB74406 (SEQ ID NO:164): 

1 mlitvqgkya vripldyyri 
61 eeaywlsdp kerssydqly 
121 eligallilq elgeyelvlk 
181 laslelgreq wqqghyenaa 
241 Ipqektierh qgldllqsil 

3 01 fdgeskrpsa vatylavyas 
361 gqteeatrvl elsqeyeala 
421 slkdyfanqq vqaylealpn 

4 81 pgeastrpvp qrshpsevnr 
541 dnyppeipve rasrpvqpgv 
601 tlgrktrllw ivlgslggil 
661 dknaqiqspe vslteetark 
721 dkadnrhrey .shsvkvdsis 
781 lirqddiwri qrmsaain 

// 



Iglplaasde 
lahaydpdna 
Igrnylgnqn 
Isletgqevl 
ddrggidgtg 
iargf tqrqp 
lireksqdsp 
daettnewav 
qfhqnrtpdp 
sgytqstppr 
Ifwlivsttf 
iienwlatka 
ksdidpnras 



qlrqaysdri 
attkvavenr 
gtastrngnh 
f segifpsvq 
ndqsglnidd 
alirhakqil 
dllpglclya 
inrqsf sqpr 
elpetsnhrr 
qtpkrrrrkk 
gwlknvf fpa 
salgaehkie 
vgatvreltq 



vqlprreysq 
gdsnnghf dv 
rtpeef Idss 
aeiqadlykl 
f Irf iqqlrh 
mrlskrqdvh 
eqwlqnevfp 
gnsysggtpv 
pessnf ttar 
pqawnrghs 
pslqgeqlsi 
slneiltgsa 
fyengqkgks 



aaiasrkqli 
qslsievsse 
erpdilltva 
rpyrilella 
hltvaeqhkl 
leqslcalll 
hf rdlsrqqa 
akrpvgkanr 
enisttdayt 
ihqqrqpsps 
qisqppleip 
Isqwrlialq 
sderlrvrye 
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NP_486747 (SEQ ID NO: 165) : 

1 mlitvqgkya vripldyyri Iglplaasde qlrqaysdri vqlprreysq aaiasrkqli 

61 eeaywlsdp kerssydqly lahaydpdna attkvavenr gdsnnghfdv qslsievsse 
121 eligallilq elgeyelvlk Igrnylgnqn gtastrngnh rtpeefldss erpdilltva 

181 laslelgreq wqqghyenaa Isletgqevl fsegifpsvq aeiqadlykl rpyrilella 

241 Ipqektierh qgldllqsil ddrggidgtg ndqsglnidd flrfiqqlrh hltvaeqhkl 

301 fdgeskrpsa vatylavyas iargftqrqp alirhakqil mrlskrqdvh leqslcalll 

361 gqteeatrvl elsqeyeala lireksqdsp dllpglclya eqwlqnevfp hfrdlsrqqa 

421 slkdyfanqq vqaylealpn daettnewav inrqsfsqpr gnsysggtpv akrpvgkanr 

481 pgeastrpvp qrshpsevnr qfhqnrtpdp elpetsnhrr pessnfttar enisttdayt 

541 dnyppeipve rasrpvqpgv sgytqstppr qtpkrrrrkk pqawnrghs ihqqrqpsps 

601 tlgrktrllw ivlgslggil Ifwlivsttf gwlknvffpa pslqgeqlsi qisqpplejLp 

661'dknaqiqspe vslteetark iienwlatka salgaehkie slneiltgsa Isqwrlialq 

721 dkadnrhrey shsvkvdsis ksdidpnras vgatvreltq fyengqkgks sderlrvrye 
781 lirqddiwri qrmsaain 
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DRAFT Nostoc punctiforme analysis files 

Version 31may01 - Contig493 Gene 84 (SEQ ID NO: 166) : 

MRNA 

GTGCGAATTCCGCTAGATTACTACCGAATTTTAGGACTACCGTTAGCGGCAAGTGAAGAACAATTGCGACAGGCA 
TACAG 

CGATCGCATTGTACAATTGCCACGACGTGAGTATTCTCAGGCAGCAATTTCTTCTCGTAAACAACTCATAGAAGA 
AGCTT 

ACGTGGTTTTATCAGATCCAAAACAACGCAGTACCTACGATCAGCTTTATCTTGCCCACGCCTATGACCCTGATA 

ACCTT 

GCTGCTGCCGCAGTAGCACAGGAAAATCGTACAGAAAGCACCAAAAGGGGTAGTGATACCCAGAGTCTTGGTATA 
GAAAT 

TACCCAAGACGAATTAGTTGGCGCTTTATTAATTTTGCAAGAGTTGGGTGAATACGAACTTGTATTGAAACTAGG 
TCGTC 

CGTACCTAGTAAATAAAAATAGTGCTACAAGTTCAAGAAAAAGCAATAACTTAGCAGATGAAGAAATTTATGAAA 
GTGCT 

GAACACCCAGATGTCGTTCTCACTGTTGCTCTTGCCTGTCTAGAATTAGGTCGGGAACAGTGGCAGCAAGGTCAC 
TACGA 

AAATGCCGCCATATCCCTAGAAACTGGTCAAGAGCTGCTAGTACGTGAAGGTTTGTTCTCCAGTATCCAGGCAGA 
AATTC 

AGGCTGATCTTTACAAATTGCGGCCATATCGAATTTTGGAGTTGCTCGCATTACCTCAAGAAAAGACTGCCGAAC 
GAAGC 

CAAGGCTTAGAATTATTGCAAAATCTCTTAGAAGATCGTGGCGGGATTGATGGCACGAACAATGATGAATCGGGT 
TTAAA 

CATAGATGACTTTCTGCGATTTATCCAGCAGTTACGCAACCACTTAACAGTTGCAGAACAGCACAAGTTATTTGA 
AGCTC 

AAAGCAAACGTTCTTCTGCTGTTGCCACTTACTTAGCTGTTTATGCCTTGATAGCGCGAGGATTTGCTCAACGGC 
AACCT 

GCTTTAATTCGTCAAGCAAGACAAATGCTCGTGCGTCTGGGCAAGCGCCAAGATGTACATTTAGAACAGTCGCTA 
TGTGC 

CTTACTTTTGGGGCAAACTGAAGAAGCAACTCGTGTTTTAGAACTTAGTCAGGAGTACGAAGCTTTAGCTTTTAT 
TCGGG 

AAAAATCTCAGGACTCTCCAGATTTGTTACCGGGTCTGTGTTTATATGCAGAACAGTGGCTGCAACACGAAGTCT 
TTCCC 

CATTTTCGAGATTTAGCAAACCAGCAAGCTTTCCTAAAAGATTACTTTGCTAACCAACAGGTGCAAGCTTATTTA 
GAAGC 

ACTGCCAACTGATGCCCAAACAACTAATGAATGGGCTGTAATTAACCCCCAGTATTTTCCCCAGGCCAAGGCAAA 
GAATA 

CTCATTTTCATAACAATTCAACTAAAACTTCAGCGTCATTTAATCACAGCAGAGTACCTAACCCAGATTTGCCAG 
AAACA 

CCAACAAAAGAAACCTCTGAATATCCAAACTTCTCACCACCTATGTGGAGTTCATCTGGAAGTATAAAATCAGAG 
GTTCC 

TGCTGCTGAAAGGATGAGCAGAGGTACTAATCAGCATTTGAACGGTTCAGCTAAGAGTGCTGCATCTGGTCATAA 
CCAAA 

AGCGTAGGCGGAGAAAACCTACTCCATCTGCTAGCCGAGAGCGTATACCAGATAATCGTCCTCATTCTCGTCGTC 
CCCGA 

AGGCGGCGAACTTTTGCGAACACCATAGAAGGTAAAACACGGCTGGTATGGAGAGTGTTTATTTCTTTGGTGAGC 
ATATT 

AGTTTTTTGGGTATTAGCCAGAACAACTTTTGGATGGTTAAAAAATCTGTTTTTTCCTCAACCTTCTCCGCCTGA 
TCTAC 

AGTTGTTTGTACAAATAAACCAACCACCGTTACCTATTCCCGATCCAAATAGAAAACCAGAATCAGAAGAAGGCC 
CTTTA 

ACAAATGCAGAGGCAGAAGAAGTTATTCACACTTGGTTATCTACCAAAGCCGCAGCTTTAGGGCCCAATCATGAG 
ATTAA 
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TAATTTAGAGCAAATTTTAACTGGTTCAGCTTTATCTCAATGGCGACTGATTGCTCAACAGAATAAGTTAGACAA 
TCGCT 

ACCGCAAGTTCGACCATAGTTTGAAGATAGAATCTGTTGAGAAAATTGGTTTATTTGCAGATCGTGCCGCAGTAG 
AAGCT 

ACGGTCAAAGAAGTGACGCAGTTATATGAAAATAATCAGTTTAAAAACTCTTCTAACGATAAATTAAGAGTTCGG 

TATGA 

CTTGATTCGAGAACGAGGTAAATGGCGTATTCAGAGTACATCTGTTGTAAATCAATTCACCAGATAA 
PROTEIN (SEQ ID NO: 167) : 

VRIPLDYYRILGLPLAASEEQLRQAYSDRIVQLPRREYSQAAISSRKQLIEEAYWLSDPKQRSTYDQLYLAHAY 
DPDNL 

AAAAVAQENRTESTKRGSDTQSLGIEITQDELVGALLILQELGEYELVLKLGRPYLVNKNSATSSRKSNNLADEE 
lYESA 

EHPDWLTVALACLELGREQWQQGHYENAAISIiETGQELLVREGLFSSIQAEIQADLYKLRPYRILELLALPQE 
TAERS 

QGLELLQNLLEDRGGIDGTNNDESGLNIDDFLRPIQQLRNHLTVAEQHKLFEAQSKRSSAVATYLAVYALIARGF 
AQRQP 

ALIRQARQMLVRLGKRQDVHLEQSLCALLLGQTEEATRVLELSQEYEALAFIREKSQDSPDLLPGLCLYAEQWLQ 
HEVFP 

HFRDLANQQAFLKDYFANQQVQAYLEALPTDAQTTNEWAVINPQYFPQAKAKNTHFHNNSTKTSASFNHSRVPNP 
DLPET 

PTKETSEYPNFSPPMWSSSGSIKSEVPAAERMSRGTNQHLNGSAKSAASGHNQKRRRRKPTPSASRERIPDNRPH 
SRRPR 

RRRTFANTIEGKTRLVWRVFISLVSILVFWVLATTTFGWLKNLFFPQPSPPDLQLFVQINQPPLPIPDPNRKPES 
EEGPL 

TNAEAEEVIHTWLSTKAAALGPNHEINNLEQILTGSALSQWRLIAQQNKLDNRYRKFDHSLKIESVEKIGLFADR 
AAVEA 

TVKEVTQLYENNQFKNSSNDKLRVRYDLIRERGKWRIQSTSWNQFTR* 



"Replacement Sheet" 



FIGo 8 coEtmped 27/40 



>Synechocystis sp. strain PCC6803 063999:2314780-2316924 (SEQ ID NO:168): 
GTGTTTATCCCCCTCGACTTTTATCGTATTTTAGGCATTCCTCCCCAGAGTGGTGGGGAA 
ACCATTGAGCAGGCCTACCAAGATCGCCTTTTACAATTACCCCGGCGAGAATTTAGTGAC 
GCCGCAGTTACTCTCCGCAATCAATTACTGGCGATCGCCTATGAAACCCTGAGGGATCCG 
GAAAAACGTCAGGCATACGACCAAGAATGGTGGGGAGCCATGGATGAAGCCCTGGGGGAG 
GCCTTACCCCTCACTACCCCGGAGTTGGAATGTAGCCCAGAGCAAGAAATTGGAGCCCTG 
TTGATCCTGTTGGATTTGGGGGAATACGAACTCGTGGTTAAGTATGGTGAGCCAGTACTC 
CACGATCCCAACCCTCCGGCGGGAGGCCTGCCCCAGGACTATTTGCTTTCGGTAATTTTG 
GCCCACTGGGAACTGAGCCGGGAACGTTGGCAACAACAGCAGTATGAATTTGCCGCCACC 
GCCAGTCTTAAGGCCCTAGCTCGGTTGCAACAGGATAATGACTTCCCCGCCTTGGAAGCA 
GAAATTCGTCAGGAACTATACCGTCTGCGACCCTACCGTATCCTCGAACTTTTGGCTAAG 
GAGGGGCAAGGGGAGGAGCAACGTCAGCAGGGTCTAGCTCTGTTGCAAGCGATGGTGCAG 
GACCGGGGCGGCATTGAAGGTAAGGGGGAAGATTATTCCGGATTGGGAAATGATGACTTT 
- GTAAAATTCATCCACCAACTACGCTGTCACCTCACAGTGGCCGAGCAAAACGCCCTATTT. _ . _ _ 
TTGCCCGAAAGTCAACGGCCATCTTTAGTAGCAAGCTATTTGGCAGTACATAGTCTGATG 
GCTGAGGGAGTGAAGGAACAGGACCCCATGGCCATTGTCGAAGCAAAATCTTTGATTATA 
CAGTTGGAAAATTGTCAAGATTTGGCCCTAGAAAAGGTAATTTGTGAATTATTATTGGGT 
CAAACGGAAGTTGTTCTGGCGGCGATCGACCAGGGAGATCCGAAAATAGTAGCTGGCCTC 
GAATCTAAGTTAGCGACGGGGGAAGACCCCTTAACTGCTTTTTATACTTTCACTGAGCAG 
TGGCTAGAGGAAGAAATTGTCCCCTACTTTAGGGATCTTTCTCCGGAGACCCTTTCCCCC 
AAGGCCTATTTCAATAATCCCTCCGTTCAGCAGTATCTAGAACAACTAGAGCCGGATTCC 
TTCACCACTGACAATTCTTTTGCCTCCCCTGCCCTCCTTAGCACCGCAACGGAATCGGAA 
ACTCCCATGGTACATAGTTCCGCCGCCCTTCCCGATCGCCCTTTGACCTCCACCGTTCCC 
TCACGACGGGGACGCAGTCCAAGACGTTCCCGAGACGATGTTTTCCCCAGCGCCGACAAT 
TCCAGTGGTTTGGCCGTCACCACCCTATCTCCGGCGATCGCCTACGACACCCACTCCTTG 
GGCACCAACGGTATTGGCGGGGATAGCACTAGCAACGGTTTTTCCAGTAACTCCGCCCCA 
GAATCCACCAGTAAACATAAATCTCCCCGGCGACGCAAAAAACGGGTGACCATCAAGCCG 
GTGCGCTTCGGCATTTTTCTGCTTTGCCTAGCAGGCATTGTGGGGGGGGCAACTGCCCTA 
ATTATCAATCGTACTGGCGATCCCCTAGGTGGGTTGCTAGAAGACCCCCTAGATGTTTTC 
CTGGACCAACCTTCAGAATTTATCCCCGATGAAGCCACGAGCCGGAATTTGATTCTCAGT 
CAACCCAACTTCAATCAGCAAGTGGGTCAGATGGTAGTACAAGGCTGGCTTGATAGTAAA 
AAGTTAGCCTTTGGCCAAAACTACGATGTCGGGGCATTGCAGAGTGTTTTAGCCCCCAAT 
CTCCTTGCCCAACAACGGGGTCGGGCCCAACGGGATCAAGCCCAAAAGGTCTATCACCAA 
TACGAACACAAGTTGCAGATTTTAGCCTATCAAGTTAACCCCCAAGACCCCAACCGAGCC 
ACCGTTACTGCCCGGGTAGAAGAAATTAGCCAGCCCTTTACCCTAGGTAATCAACAGCAG 
AAGGGCTCCGCCACCAAAGATGACTTGACTGTGCGCTATCAGCTAGTACGACACCAAGGG 
GTTTGGAAAATTGACCAAATACAAGTGGTAAATGGCCCCCGTTAG 



NP_441990 (SEQ ID NO: 169) : 

1 mfipldfyri Igippqsgge tieqayqdrl Iqlprrefsd aavtlrnqll aiayetlrdp 
61 ekrqaydqew wgamdealge alplttpele cspeqeigal lilldlgeye Iwkygepvl 
121 hdpnppaggl pqdyllsvil ahwelsrerw qqqqyefaat aslkalarlq qdndfpalea 
181 eirqelyrlr pyrilellak egqgeeqrqq glallqamvq drggiegkge dysglgnddf 
241 Ikfihqlrch Itvaeqnalf Ipesqrpslv asylavhslm aegvkeqdpm aiveakslii 
301 qlencqdlal ekvicelllg qtewlaaid qgdpkivagl esklatgedp Itafytfteq 
361 wleeeivpyf rdlspetlsp kayfnnpsvq qyleqlepds fttdnsfasp allstatese 
421 tpmvhssaal pdrpltstvp srrgrsprrs rddvfpsadn ssglavttls paiaydthsl 
481 gtngiggdst sngfssnsap estskhkspr rrkkrvtikp vrfgifllcl agivggatal 
541 iinrtgdplg. glledpldvf Idqpsef ipd eatsrnlils qpnfnqqvgq mwqgwldsk 
601 klafgqnydv galqsvlapn llaqqrgraq rdqaqkvyhq yehklqilay qvnpqdpnra 
661 tvtarveeis qpftlgnqqq kgsatkddlt vryqlvrhqg vwkidqiqw ngpr 



BAA10060 (SEQ ID NO: 170) : 
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1 mfipldfyri Igippqsgge tieqayqdrl Iqlprrefsd aavtlrnqll aiayetlrdp 

61 ekrqaydqew wgamdealge alplttpele cspeqeigal lilldlgeye Iwkygepvl 
121 hdpnppaggl pqdyllsvil ahwelsrerw qqqqyefaat aslkalarlq qdndfpalea 

181 eirqelyrlr pyrilellak egqgeeqrqq glallqamvq drggiegkge dysglgnddf 

241 Ikfihqlrch Itvaeqnalf Ipesqrpslv asylavhslm aegvkeqdpm aiveakslii 

3 01 qlencqdlal ekvicelllg qtewlaaid qgdpkivagl esklatgedp Itafytfteq 

361 wleeeivpyf rdlspetlsp kayfnnpsvq qyleqlepds fttdnsfasp allstatese 

421 tpmvhssaal pdrpltstvp srrgrsprrs rddvfpsadn ssglavttls paiaydthsl 

481 gtngiggdst sngfssnsap estskhkspr rrkkrvtikp vrfgifllcl agivggatal 

541 iinrtgdplg glledpldvf Idqpsefipd eatsrnlils qpnfnqqvgq mwqgwldsk 

601 klafgqnydv galqsvlapn llaqqrgraq rdqaqkvyhq yehklqilay qvnpqdpnra 

661 tvtarveeis qpftlgnqqq kgsatkddlt vryqlvrhqg vwkidqiqw ngpr 



AY074283 (SEQ ID N0:171): 

MPVAYTFPVLPSSCLLCGISNRSTSFWDRPELQISGLLWRSE 

SGEFFGSGLSLRRFQREGRRRLNAAGGGIHWDNAPSRTSSLAASTSTIELPVTCYQL 

IGVSEQAEKDEWKSVINLKKTDAEEGYTMEAAAARQDLLMDVRDKLLFESEYAGNLK 

EKIAPKSPLRIPWAWLPGALCLLQEVGQEKLVLDIGRAALRNLDSKPYIHDIFLSMAL 

AECAIAKAAFEVNKVSQGFEALARAQSFLKSKVTLGKLALLTQIEESLEGLAPPCTLD 

LLGLPRTPENAERRRGAIAALRELLRQGLSVEASCQIQDWPCFLSQAISRLLATEIVD 

LLPWDDLAITRKNKKSLESHNQRWIDFNCFYMVLLGHIAVGFSGKQNETINKAKTIC 

ECLIASEGVDLKFEEAFCSFLLKQGSEAEALEKLKQLESNSDSAVRNSILGKESRSTS 

ATPSLEAWLMESVLANFPDTRGCSPSLANFFRAEKKYPENKKMGSPSIMNHKTNQRPL 

STTQFVNSSQHLYTAVEQLTPTDLQSPWSAKNNDETSASMPSVQLKRNLGVHKNKIW 

DEWLSQSSLIGRVSWALLGCTVFFSLKLSGIRSGRLQSMPISVSARPHSESDSFLWK 

TESGNFRKNLDSVNRNGIVGNIKVLIDMLKMHCGEHPDALYLKSSGQSATSLSHSASE 

LHKRPMDTEEAEELVRQWENVKAEALGPTHQVYSLSEVLDESMLVQWQTLAQTAEAKS 

CYWRFVLLHLEVLQAHIFEDGIAGEAAEIEALLEEAAELVDESQPKNAKYYSTYKIRY 

ILKKQEDGLWKFCQSDIQIQK" 



At3gl9180 (SEQ ID N0:172): 

1 actgtcaaaa ctcaaaagcc ttgagaccaa atttccgatt ttttctcctc tgaagaaatc 

61 caacaaattg taccatgatt ccagcttcac tctacttctt ctagggttcg ttcgttttct 

121 ggagctgttg cgcaatgcca gtagcttaca catttccagt tctcccttct tcttgtctgc 

181 tttgcggaat ctccaatcgc agcaccagct tcgtcgtaga tcgcccggag cttcagatct 

241 caggtctcct cgtcgttcgt tctgaatccg gtgaattctt cggttctggt ttatctttgc 

301 ggcggtttca gcgagaagga cggaggaggt tgaatgctgc tggtggtggt atccatgtcg 

361 tcgacaatgc gccgtctcgt acttcttctc tcgctgcatc tacctctaca atcgaactcc 
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421 cggttacgtg ttaccagctt atcggagttt 
481 agtcggttat aaatttgaaa aaaactgatg 
541 cagctcgcca ggatcttctc atggatgtta 
601 ctggtaacct aaaagaaaag attgctccta 
661 tgcctggtgc tctatgcctt cttcaagagg 
721 gccgggctgc tctcaggaac cttgattcaa 
781 tggcacttgc tgagtgtgca attgccaagg 
841 gatttgaagc tcttgctcgt gctcaaagtt 
901 ttgcattgtt aactcagatt gaggagtcac 
961 atctactggg cctgccacgc acgccagaaa 
1021 cgctacgcga actgctcaga cagggcctta 
1081 ggccatgctt tttgagccag gcaattagca 
1141 ttccatggga tgatttagcc attacacgga 
1201 aaagagttgt tattgatttt aattgtttct 
1261 gattttcagg caagcaaaat gaaacgatta 
1321 tagcatcaga aggtgttgat ctgaaatttg 
1381 agggttccga ggcagaggcc ctggaaaaac 
1441 ccgttcgtaa ttcgatcttg gggaaagagt 
1501 aagcgtggct aatggagtcc gtgcttgcta 
1561 ctttggccaa ttttttccgg gctgaaaaga 
1621 cttcgatcat gaatcataag acgaaccaaa 
1681 cgtcacaaca tctttataca gctgtcgagc 
1741 tggtatcagc caagaataat gatgaaacca 
1801 ggaaccttgg tgtacacaaa aataaaatat 
1861 tcggaagggt atctgttgtt gctttactgg 
1921 caggcattag gtctggtaga ctacagagta 
1981 cagaatcaga ttcttttctg tggaaaacag 
2041 ctgtgaatag aaatggtatc gtgggaaaca 
2101 attgtggcga acatccggat gccctgtatc 
2161 tgtctcattc tgcgtcagaa ctgcataaga 
2221 ttgtgagaca gtgggaaaat gttaaggctg 
2281' gcctttccga agtccttgat gaatccatgc 
2341 cagaggcgaa atcctgttat tggaggttcg 
2401 atatattcga agatggtatt gctggtgagg 
2461 cagcagaatt agttgatgaa tctcagccca 
2521 tccgatatat tctgaagaag caagaagatg 
2581 aaatacagaa gtgaaaatcc cccagaaaaa 
2641 atcaacagta gaacatggga tcatttagct 
2701 ggaaagtctc aggtttgttt ctttattcct 
2761 tcttttgatt tcaatgtgtt tatggataaa 
2821 ttgtaaagcg ttactgatca caaaaaaaaa 



ctgagcaagc tgagaaagac gaggtcgtta 
ctgaagaggg ttatacaatg gaagctgctg 
gggataaact tctttttgaa tcagaatatg 
aatctcctct cagaattccg tgggcatggt 
ttggacaaga aaaacttgtg ctggatattg 
agccatatat tcatgatata ttcttatcta 
ctgctttcga ggttaacaag gtctctcaag 
ttctgaagag taaagttact cttgggaaac 
tagaggggct tgcaccacct tgcacattgg 
atgcagagag gaggcgaggt gcaattgccg 
gtgttgaagc ttcatgtcaa attcaagact 
ggttattggc cacagagatt gtcgatcttc 
aaaataaaaa atcactggaa tcccacaatc 
acatggtgtt acttggtcac atcgctgttg 
ataaagcaaa aacgatatgc gaatgtctca 
aggaagcttt ttgctcattt cttctaaaac 
ttaagcagct ggaatcaaat tcagactctg 
cgagaagtac ttctgctact ccctcactgg 
actttccaga cacaaggggt tgttctccat 
aatatccaga aaacaagaaa atggggtcac 
gaccactttc cacaacacag ttcgtgaact 
agttgacacc aacagatttg cagagcccag 
gtgccagtat gccatctgtt caactgaaga 
gggatgagtg gctctctcaa agcagtttga 
gttgcaccgt gttcttctct ctgaagctat 
tgcctatatc ggtttctgct aggccgcatt 
agtctgggaa tttcagaaaa aaccttgatt 
tcaaagtgct cattgacatg ttaaagatgc 
tgaaaagctc tggtcaatca gctacatcat 
gaccaatgga tacagaagaa gcggaagagc 
aagctcttgg accaacacat caagtttata 
ttgtccagtg gcaaacattg gcacaaacag 
ttctgcttca tcttgaggtt ttgcaagcac 
ctgcagaaat cgaagctctt ctggaggaag 
aaaacgcaaa atattatagc acttacaaga 
gattgtggaa attctgccaa agcgatattc 
aaagctcatc atctaactaa aggttgtagc 
aacggttgtt cttgtttacc taacggtgta 
tagtaaccca caggatttgt ctttgtagat 
caaacttctt gagtattttt tttattatta 
aaaaaaa 



"Replacement Sheet" 



FIG. 8 continued 30/40 



AAL66980 (SEQ ID NO:173): 

1 mpvaytfpvl psscllcgis nrstsfwdr pelqisgllv vrsesgeffg sglslrrfqr 

61 egrrrlnaag ggihwdnap srtsslaast stielpvtcy qligvseqae kdewksvin 

121 Ikktdaeegy tmeaaaarqd llmdvrdkll feseyagnlk ekiapksplr ipwawlpgal 

181 cllqevgqek Ivldigraal rnldskpyih diflsmalae caiakaafev nkvsqgfeal 

241 araqsf Iksk vtlgklallt qieeslegla ppctldllgl prtpenaerr rgaiaalrel 

301 Irqglsveas cqiqdwpcfl sqaisrllat eivdllpwdd laitrknkks leshnqrwi 

361 dfncfymvll ghiavgfsgk qnetinkakt icecliaseg vdlkfeeafc sfllkqgsea 

421 ealeklkqle snsdsavrns ilgkesrsts atpsleawlm esvlanfpdt rgcspslanf 

481 fraekkypen kkmgspsimn hktnqrplst tqfvnssqhl ytaveqltpt dlqspwsak 

541 nndetsasmp svqlkrnlgv hknkiwdewl sqssligrvs wallgctvf fslklsgirs 

601 grlqsmpisv sarphsesds flwktesgnf rknldsvnrn givgnikvli dmlkmhcgeh 

661 pdalylkssg qsatslshsa selhkrpmdt eeaeelvrqw envkaealgp thqvyslsev 

721. Idesmlvqwq .tlaqtaeaks cywrfvllhl evlqahif ed giageaaeie alleeaaelv 

781 desqpknaky ystykiryil kkqedglwkf cqsdiqiqk 
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Second Set 

BJ258222 (SEQ ID NO:174): 

GGCCGTCGGCAAATACTGCAGNTTGCACATGATACTCTCACAAACCAGAGCTCCCGCACC 

GAGTATGACCGCGCGCTCTCTGAGGACCGTGACGCGGCGCTCACACTGGATGTTGCTTGG 

GACAAGGTTCCGGGTGTGCTATGTGCCCTTCAGGAGGCTGGGGAGGCACAGGCAGTGCTT 

GCAATTGGAGAGCACTTACTGGAGGACCGCCCGCCCAAGCGGTTCAAGCAGGATGTGGTG 

CTGGCAATGGCGCTCGCTTATGTGGACATATCAAGGGATGCAATGGCGGCT^ 

GATGTAATCCGCTGCTGTGAGGTGCTTGAAAGGGCTCTCAAGCTCTTGCAGGAGGATGGG 

GCAATCAACCTTGCACCTGGTCTGCTTTCACAAATTGATGAAACTCTGGAGGAGATCACA 

CCTCGTTGTGTTTTGGAGCTTCTTGCCCTTNCTCTTGATGAAAAACATCANATTGAACGC 
CANNAANGNNT 

BQ410206 (SEQIDNO:175): 

AATTGCAGAAGGCATTGTTCGCAAGTGGCAGAACATTAAATCTGAGGCGTTTGGACCTGA 

TCACCGCCTTGATAAATTGCCAGAGGTTCTGGATGGTCAAATGTTGAAGACATGGACAGA 

TCGTGCAGCCGAAATCGCTCAGCTTGGTTGGGTATATGAATATAGTCTACTGAACATGGC 

CATTGACAGTGTTACCCTTTCACTAGATGGCCAGCGAGCTGTAGTCGAAGCTACTCTGGA 

AGAATCCACCTGCTTGACTGATGTTCATCATCCGGAGAACAATGCCTCTAATGTAAACTC 

CTACACCACGAGATATGAGATGTCTTGTTCCAACTCAGGCTGGAAAATCACTGAAGGATC 

TGTCTACAAATCTTAACTATGATGTATAAAGCATAAAAAGCCTGAAAGCTCCAATGTGGT 

TACCAGCTTTGCCTTTTTACGTAGCTATATTTGTTATATTGTTTGAGAAAACAAGAGTTA 

GCGTTTTCCAGTCATGCAAGCAGTTCAAATTAAAAGAGGCAATGCTTNTCATGGANAACN 
AAATG 
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AJ485537 (SEQ ID NO:176): 

GATGAGCCCATACAGATTCCTAAAATGGATGCGAAGCTGGCAGAAGATATTGTTCGCAAG 
TGGCAGAGCATCAAATCCAAGGCCTTGGGATCAGATCATTCTGTTGCATCATTGCAAGAG 
GTTCTTGATGGCAACATGCTGAAGGTATGGACAGACCGAGCAGCAGAGATTGAGCGCAAA 
GGCTGGTTCTGGGACTACACGCTGTTCAACGTGGCGATCGACAGCATCACCGTCTCCCTG 
GACGGACGGCGGGCGACCGTGGAGGCGACAATTGAGGAGGCGGGTCAGCTCACCGACGCA 
ACCGACCGCAGGAACGATGATTTGTACGACACTAAGTACACCACCCGGTACGAGATGGCC 
TTCACCGGACCAGGAGGGTGGAAGATAACCGAAGGCGCAGTCCTCAAGTCGTCATAGGGC 

BJ263824 (SEQ ID NO: 177): 

CTGCAAATCTAGCACTATGTTTCTCTTTATCTCCAGGATCTAGCCTAGCACCAACAATCC 

AAATACAACACAAGAAAAATAAAGCTCTTCGTCGATCACATCAGACTAACGCAACTATCG 

GTCTTCCAAACTAAAAAGGGCCTAGACTGCCTGCTTATTTACACACCCCCAAAAGAAAAC 

TGGAAGGAATTAACAAACTTAATGAGGTTACCGCACACCAACTACCCTAAGACGACTTGA 

GGACCGCGCCTTCCATTATCTTCCACCCTCCTAGTCCGGTGAAGGTCATCTCATACCGGG 

TGGTGTACTTCGTGTCGTACGAGTCGTTGTTCTTGGGGTCGGTTGCGTCGATGAGCTGGC 

CTGCCTCCTCGATCGTTGCCTCCACGGTCGCCCGCCGTCCGTCCAGGGAGACCGTGATGC 

TGTCGATCGCCACGTCAGACAGTGTGTAGTCCCAGAACCAGCCTTTGCGCCCGATCTCCG 

CTGCTCGGTCCGTCCATACCTTCAGCATGTTGCCATCAAGAACCTCTTGCAATGATTCCA 

CAGAATGATCTGATCCCAAGGCCTTGGTTTTGATACTCTGCCACTTGCGAACAATATCTT 
CTGCCA 



BQ410207 (SEQIDNO:178): 



CTGCTTGCCTGACTGGAAAACCCTAACTCTTGTTTTCTCAAACAATTTAACAAATATAGC 
TCCCTAAAAAGGCAAAGCTGGTAACCACATTGGAGCTTTCAGGCTTTTTATGCTTTATAC 
ATCATAGTTAAAATTTGTAGACAGATCCTTCAGTGATTTTCCAACCTGAGTTGGAACAAA 
ACATCTCATATTTCGTGGGGTAGGAGTTTACATTACAGGCATTGTTCTCCGGATGATGAA 
CATTACTCAAGCCGGGGGGTTCTTCCAAAATAACTTCGACTACAGCTCGCTGGCCATTTA 
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ATGAAAGGGTAACACTGTCAATGGCCCTGTTCAGTCAACTTTATTCATATACCCAACCCA 

GCTGACCGATTTCGGCTGCACCAACTGTCCATGTTTTCAACATTTGACCATCCAAAACCT 

TTGGCAATTTATCAAGGGGGGGATCAAGTCCAAACGCCTCAGATTTAATGTTCTGCCACT 
TGCGAACAATGCCTTTTGCAATT 

AJ485539 (SEQ ID NO:179): 

GATGAGCCCATACAGATTCCTAAAATGGATGCGAAGCTGGCAGAAGATATTGTTCGCAAG 
TGGCAGAGCATCAAATCCAAGGCCTTGGGATCAGATCATTCTGTO 

GTTCTTGATGGCAACATGCTGAAGGTATGGACAGACCGAGCAGCAGAGATTGAGCGCAAA 
GGCTGGTTCTGGGACTACACGCTGTTCAACGTGGCGATCGACAGCATCACCGTCTCCCTG 
GACGGACGGCGGGCGACCGTGGAGGCGACAATTGAGGAGGCGGGTCAGCTCACCGACGCA 
ACCGACCCCAGGAACGATGATTTGTACGACACTAAGTACACCACCCGGTACGAGATGGCC 

AJ463103 (SEQ ID NO: 180): 

TGATGGCAACATGCTGAAGGTATGGACAGACCGAGCAGCAGAGATTGAGCGCAAAGGCTG 
GTTCTGGGACTACACGCTGTTCAACGTGGCGATCGACAGCATCACCGTCTCCCTGGACGG 
ACGGCGGGCGACCGTGGAGGCGACAATTGAGGAGGCGGGTCAGCTCACCGACGCAACCGA 
CCCCAGGAACGATGATTTGTACGACACTAAGTACACCACCCGGTACGAGATGGCCTTCAC 
CGGACCAGGAGGGTGGAAGATAACCGAAGGCGCAGTCCTCAAGTCGTCATAGGGCGTTCA 

BQ169059 (SEQIDN0:181): 

TTTTTTTTTTTTTTTTTTTTTTTTTTTTCAGCGGCAAATTCAGCACTATGTTTCTCTTAT 
CCCCAACTCAAAGATCTTCTAAGCTAGCAATAATCCGAAAACGACACAGGGAAAAACAAA 
GCTCATCGCTGATTGCACATCAGACTAACCAAACTATCTCCAACTTCCAAACTGAGAAGG 
GCCTAGACTGCTTATTTACACACCAAAAAGAACACGGGAGGAATCAATCAACAAAGGTCT 
ACTGCACACCGAACGCCCTATGACGACTTGAGGACCGCACCTTCTGTTATCTTCCACCCT 
CCTGGTCCAGTGAAGGTCATCTCGTACCGGGTGGTGTACTTAGTGTCGTACAAATCGTTG 
TTCCTGGGGTCGGTTGCATCGGTAAGCTGGCCTGCCTCCTCAATTGTCGCCTCCACAGTC 
GCCCGTCGTCCGTCCAGGGAGACGGTGATGCTGTCAATCGCCACGTCGGACAGCGTGTAG 
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TCCCAGAACCAGCCTTTGCGCTCGATCTCTGCTGCTCGGTCCCTCCATACCTTCAGCATG 
TTGCCATCA 



BJ482132 (SEQIDNO:182): 

GCGAGNAAGGACGAGNATCGTCAAGTCGGCCATCGAGCTGAGGAAATCGGAGATCGAAGA 

TGGGTACACGGAGGAGGTGTCCACCTGCAGACAGGCTCTGCTGCTGGACGTGAGAGACAA 

GCTTCTCTTTGAACAGGAGTACGCAGGAAGCACCAGGGCCAAGGTTCCGCCCAGATCCTC 

TCTTCATATACCCTGGAGCTGGTTGCCTGCfGCCTTGTGTGTCTTGCAGGAGGTTGGGGA - 

AGAGAAGCTGGTCTTGGACATTGGTCAGGCAGCTCTACGACGCCCTGATTCTAAGCCATA 

TGCTCACGATGTACTTCTTGCAATGGCACTAGCTGAATGCTCCATTGCAAAAGCTAGCTT 

TGAAAAAAGTAAAGTATCTCTTGGCTTTGAGGCTCTAGCACGTGCTCAATATCTTTTGAG 

GAAAAAACCATCTTTAGAGAAGATGCCTCTTCTTGAGCAGATCGAAGAATCACTTGAAGA 

GCTTGCACCAGCTTGCACTCTAGAGGTTTTAAGCCTGCCCCGTACACCTGAAAATTCTGA 
ACGCAGGCGTGGTGCTATTGCAGCTCTCTGTGA 

BQ490457 (SEQ ID NO: 183): 

GCATAACACGGCAAGAAGATGTTGCAGTTAATGGCTTTGGAAATGAGGATGTTACAATGG 

AGCTTGGCCGTGATAACACTTTAGATTATGTGAATTTAGCCAGTTCAAATTTTACTGAAG 

ATAATATCGAGCAAGAATCGGTTACTGAGAAGATAAAAGATTTAGGTGTGAAGGTTATGT 

GTGCCGGTGTGGTGATTGGACTGACAACTTTGGCTGGCATGAAACTTTTGCCTGGCAGAA 

GTGGGTCTGCCATTCCACACAGGCATCTTGGTTCTGCTGTGGCTTCTGATGTCTCCAGTG 

TGGGGCTCTCAGTAAATGAAACTACTGAGGAGAAAGTACCAAAAATGGATGCAAGACTTG 
CAGAAGTTCTAGTTAGAAGATGGCAGAACGTTAAATCACA 

BU046755 (SEQ ID NO: 184) : 

1 gcagttgcaa ttgctggggg ngattcacta cgtgaaaatt tcatgaacga ggccttcttg 

61 catatgactg cagctgagca ggttgattta tttgtagcta cccccagtaa tatcccggca 

121 gaaagctttg aagtttatgg ggtggctctt gcgcttgttg ctcaagcctt tgttggtaaa 

181 aaacctcatc acattcaaga tgctgaaaac ctattccaga aacttcagca gtctaaggta 

241 acagctgtag gacattctct tgacaactat ataaccaaag aaagcagtga gatagacttt 

301 gctttggaga ggggactctg ttcacttctt ctaggggacc ttgatgacag tcgttcgtgg 

3 61 ttgggcctag acagtaatga ttcaccatat agaaatccat ctgttgtaga ctttgtcttg 

421 gagaactcaa aggatgacga tgacaatgac aatgacaatg atcttcctgg actttgcaag 

481 ctattggaga cgtggttgat ggaggtggta ttccccaggt ttagagacac caaagacata 

541 gagttcagac tgggagacta ctatgatgat cctacagtct tgagatactt agaaaggctg 

601 gatggcacta atggttcacc cttagctgct g 

// 
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BU035730 (SEQ ID NO:185) : 

1 cagaaagagg tggctggatt gatgactttg gctggcttga aatttatacc gtcttaaaca 
61 ggctctacta gtactactgc tcgtaaagaa gttgattcgg ctctggcttc agacgtcacc 
121 aatgtggagg attctagggt tgaggatgct gaagacattc ctaaaatgga tgcaagatta 
181 gccgaaggtc tagttcgtaa gtggcagagc ataaaatccc aagcccttgg acctgagcat 
241 tgccactcaa aattatcata ggtattagat ggtgaaatgc acaagatctg gcttcaacgg 
301 gcaaccgaaa ttgctcaacg tggttggttt tgggactaca cgcttttaaa cattaccatt 
361 gacagtgtta ccgtttcact cgatgggcgc ttagctgttg tggaagcaac ccttgaagag 
421 tctgccaagt tgattgattt gacccacccg gaaaacaatg actcctataa tttaacttac 
481 accacacgtt atgagatgtc gtgtgccaag tcatcatgga aaatcacaaa gggggctgtc 
541 ctcaaatcat aacagatgta attctttctc accttttctg tatttatctg ttattagatt 
601 actcagcagt tgaatgatat gtttctccac catttcgatc atgagcg 

BQ977057 (SEQ ID NO: 186): 

1 tgtggtggtt ggattgatga ctttggctgg cttgaaattt acaccgtcca aaagaggctc 

61 tactagtact actgctcgta aagaagttga ttcggctctg gcttcagacg tcaccaatag 

121 gattctaggg ttgaggatgc tgaagacatt cctaaaatgg atgcaagatt agccgagggt 

181 ctagttcgta agtggcagag cataaaatcc caagcccttg gacctgagca ttgccactca 

241 aaattatcag aggtattaga tggtgaaatg cacaagatct ggcttcaacg ggcaaccgaa 

301 attgctcaac gtggttggtt ttgggactac acgcttttaa acattaccat tgacagtgtt 

3 61 accgtctcac tcgatgggcg cttagctgtt gtggaagcaa cccttgaaga gtctgccaag 

421 ttgattgatt tgacccaccc ggaaaacaat gactcctata atttaactta caccacacgt 

481 tatgagatgt cgtgtgccaa gtcttcatgg aaaatcacaa agggggctgt cctcaaatca 

541 taacagatgt aattctttct caccttttct gtatttaact gttattagat tactcagcag 

601 ttgaatgata tgtttctcca ccatatcgat catgagtgta tttggtgctg cc 

BU889000 (SEQ ID NO: 187): 

1 gactgaaaaa ataaaagatg ccagtatcaa aatatgtgtg ctggtgtggc aattggactg 

61 ctgactttag ctggcctgaa gtgttttcct cctaggactg gctccttcat tcgacagaaa 

121 gaaattggtt cggcaatggc atctgacacc atcaatttga attcagcagt agatgaacaa 

181 atttccgagg acttacccag aatggatgca aggggtgcag aggatatagt tcgcaagtgg 

241 caaaacatta aatctcaggc ttttggaact gatcactgcc tggcaaaatt gccagaggtt 

3 01 ttggatagtc agatgttgaa aatatggaca gatcgtgcgg ccgaaattgc acatcttggt 

361 tgggtatacg agtatatgct gttggacctg actattgaca gtgtgactgt atctgtagat 
421 ggcctaaatg ctgtagtaga agcaacactc aaagagtcaa 
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>genie.294 . 6 jGenomic (SEQ ID NO:188): 

ATGAACTCGGCGGAGCACGTCTCTGTTGCCGTGGACTATTACCGAATGCTGCACGTTCCCCGCGTAAGCC 

GCCCTGACGCCATTCGCAAGGCGTATGAGAACCTGGTGAAGCAACCCCCCGCTGCCGCGTACTCTGCGGA 

CACCCTCTTCGCACGCGCGGTGCTACTCAAGGCAGCCGCGGAGTCGCTGACCGACCCGGACCTGCGCCGC 

TCATATGACGCCAAGCTGGCCGCTGGTCACACAGCCCTGCGCGTCAGCCAGCAGGACCTACCCGGAGCCC 

TTGTCGTGCTGCAGGAG GTGAGCCGTGCTCTGGCGACCGCTCAACCCCTTGCGACCGCTAAAACCATCAG 

CACATATAGCACATATAAATTCCCATGGGTTCTGTACTACCGCCCACCCCTCTGAAGGGGGCGAGTATTC 

ATTCTTCACGCATGAGCGCAGACTTTTACCCTATCAAGTCCCGCCCTCGCCCGCCTTCTCTTCCCACAGA 

TCGGCGAGCACCAGTTGGTTCTGGATCTGGGTCTGCGCTGGCTAGAGGTAAACGGCGGCCAGCCCGACGC 

CGGCGACGTGGCCGCTGCCGTGGCCCTGGCCTACTGTGACCGCGCTGGTGAGCGCCTCACCTCCCAGCTG 

CAGCCGCCGCCGGCCTCAGCGCTGCCAGGCCCCGATGGCGCGGCGGTGCCGCACGCGCACGTGGGCGCGG 

TGCTGCCCGCATGCGACGACCTGGACGCAGCGCTGAGCAAGCTCCGGCGGTACGGCATGGCGCAGCAGCT 

GCAGCAGCAGATCGTGGGCGCGCTGCGG GTGAGGGTGGAGCAGGGGCTGGACCGGCAACCGGTCATAGAT 

GTAGACACAGGGATGTAGGCGTCGATGCGAGGGGATGGAAGTATGGGGTCCTGTGAGTGTGAGCCGATGG 

AAGGTATAGATGCTGGGAGCTGGCGCACCCGACCCATGTCATCCAAGGACTTGGCTGATGCATCGCTCAC 

CCCCCGCCTCCAACCCGAATGCCCTCAG GACCTGGCGCCAGAGTACGCGTGCGAGCTGGCCGCCCTGCCG 

CTGGGCGCCGAGACCGCCGCCCGGCGCGCCAAGGGCGTGGCGCTCATGCGCGGTGTGCTGCGCGCCGCCG 

CCACCGTGGCCGCCGCCACAGCCAA GTAGGTGACAAGCACGCAGGAAATCGTGTGCTATATTGCATTGCG 

GTACCTTGCCTTGCATCGCGGAGGCAGTGCTCGAGAATGCGTTTCGTGCGCGTGATCCGTTTGCTCGTCG 

TGCCTTATCCGCCACCCCAG GCCCGAGGCTGCTGCTGACGACAGCGACGACGACGAGGTGGACCCGCGCA 

GTGTGCTGGCGGCCGCCCGCCGCATGCTGACCCGCAGCCGCGACGTGCTCACCTGCAGCGAGCAGG TACA 

GCGCTGCAACCGGGCAGTTATAGATGGATGCAAGTGCGTGGACGCCGAACGTACAGTTTTTGCTGTGTTC 

CCCGCGTGCACCTTAGCCGCTCCTCCTGCAACCCTCACTTGCGACCTCAATGCGTGCACCTTAGCCGCTC 

CTCCTGCAACCCTCAGTTGCGACCTCACGACACACCGTCTGGCTTACCCCTGCCCCCACCCCA GGTGGCC 

CTGCTGCCGGACGCGCTGCGCGGCAGCGGTGTGTCGCCCACCCCGGACGCGCTGTACGACGGCGCCCTGG 

CGCACCTGGTGGACGGCTTCCGCAACGGCTGGCCGCACTCCGTGCACCAG GTGGGGGAGCGCGGTGCCTG 

GATGTCTGGATGGTCACTGGCCGCAAGGCTGTGCGCACCATCGGGTAGAGTGTAACCAAATGATGTGCGC 

GCAATGAAGGGTGAGCAGATTCCAGCCTCCCTCTGTCGGCTGGCGTCCAACTGTGCCAACTGCGCACACA 

CCTGCGCACGCCCCAG GCCGACCAGCTGCTGGCCAAGCTGGAGGCGCAGCAGGCCCGCGCAGCCGCCATG 

CGCCGCGAGCAGTCCGAGCTGGCCGCCGCCGCCGCAGCCCGCCGTGCCATGTACAGCGGTCCCGCCGCCG 

CCCACGGTCCCACCCTGTACACCAACTACAACAACCCTGCCGGCAGCGGCAATggcgcgccgccgccgcc 

gccccgccccATGCCCATGGTGCCCAGGGGCGACGGCCAGCACGCCATGGCGGCGTCTGTGGCGGCGCAT 

GTGCACTCCACGGCGATGGCGGAGCAcgcggcgcgcagcgcggctggcggcgccgccggcgccTCCGATG 

GCGGCGCGCACGCCAACGGCGTGGCTCTAGAGCGGGCCGTGTGCGCCGTCCTGCTGGGTGACTACACCGC 

GGCGGTGGAGCGGCTGGGGCTAGACACGAACGCGGCGGTGGAGCAGGAGCAGCTGCGCGAGTTCGTCCTG 

GTGCGCCGGGGAGGGCCTACTGCAAAACGTGTTGCTCAGGGTCTTGAGATACCGAACACAATGTTTTCGT 

ATACATCTCCCGTCGAGAGAGCTATGCCTCCACCGTCGGCCCGGCTCCACTGCACCCGATGCGGTTGCAG 

GCCCACTCGCCCAACGGCCGCGGCGACCTGCGCCCGGGCCTGAGGGCGCTGGCCACCCGCTGGCTGGAGG 

GCGTGGCGCTGGCGTCCTTCCGCGACACTGCCGGCAGCCCCGTGCCGCCGCTGGAGGCCAGCTGGTTCGC 

GGACCTGCGTGTCGCCTTCTATCTGCAG GTGAGGGGCGGCAGAAGAGAGGGGGGAAAGGGAGGCGAGAAG 

GCGCTTCCGCCGCTGGCGCAACGGGCCATCCTGGTGGAGCACGGCGCTACATCGCATCTGGTCCACCGTC 

TCTGGATGTATAATTCGTGCACTCTTAACCGGCCGCGCAG GTATGGCGGCTGTGCCGCGTGGAGCAGGTG 

CTGGCCGCCGCCCACTTCCTGGCCAACCTGCTGCCCAACATGCTCAAGgccatcgccggcactgccgtca 

aggtcgcagccaacaccgccgtggcagcctcccgcgcgcagcgcctcagcgccaccgtcgcggccagcac 

cgccaccgcctcgtcatcttcctctgccgcccgcggcgctcgtgccggtgccctgagcgctgccaccgcc 

gccgcacacgccgcgcgccgccAGCAGGCGAACGCGGTCGGTGCCAGCATCGTCGGTGCTGACGTGCTGC 

CCCCCACAGCAGTGgccgcggctgccgcggctggcacagcggccgccgccgcagtcaccggccccgccct 

cggccgtggcgctgcagcttccgcctcttcctttgaggagggcgccgctgaggccgctgacctgcgtcgt 

cgctttgtcgccaccagccgcggcgccagcgcggccgtcgGTGCGCCCACAGCACCAGCCGCTATGACTG 

GGCCCCAGCACGGCGCCGCCTGTGCTGCGCAGTCGCACCGGGAGGAGGATGAGGATTCGCACGGCGGCCA 

GGAGGGGGGCGTGCCGCGGCGCATGAGCGAGGCGGACCTGCGTGCGCACCTGGCGGGCCTGGAGAAGGCC 

ATGTGGGACTCGGAGCTGCCGCCGCCGCCGCCATCCCGCGCGCAGAAGGCGCTCACCTACGCCGCAGGAC 

TGGTGAGTTGCTGCGCAGCCTGACGGCCATAGTTGCCGTAGTGCCATAGTGACCGAGCACCGTGATGTTT 

AGGACATGGGCGGAGAAGTGTTAGGACATGAATTGCATCAACGCTGCAAATCTGGTGTATGGTACGCGCG 
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TTCCCTGTCACCAACAAGGCTGTTGACCAAGCTGCTGCTGCCCTTGCACTCTTTCAACGCCCGTCTGCAG 
CTGGCCGTGGTGGTGGCCTTCCTGGTGTCCAGCTTCTTccgccgcaacgacggcgccgcctccgccctgg 
cacccgccgccgtcaccaccgcctccgtggccgTTAGCGCGCAGCCCGCCAAGCCGGGCAAGGCCACCCG 
CTCCGCGCACTGA 

Transcript Sequence [46927:50859] (without introns) 

>genie.294 .6 jTranscript (SEQ ID NO: 189): 

ATGAACTCGGCGGAGCACGTCTCTGTTGCCGTGGACTATTACCGAATGCTGCACGTTCCCCGCGTAAGCC 
GCCCTGACGCCATTCGCAAGGCGTATGAGAACCTGGTGAAGCAACCCCCCGCTGCCGCGTACTCTGCGGA 
CACCCTCTTCQCACGCGCGGTGCTACTCAAGGCAGCCGCGGAGTCGCTGACCGACCCGGACCTGCGCCGC 
TCATATGACGCCAAGCTGGCCGCTGGTCACACAGCCCTGCGCGTCAGCCAGCAGGACCTACCCGGAGCCC 
TTGTCGTGCTGCAGGAGATCGGCGAGCACCAGTTGGTTCTGGATCTGGGTCTGCGCTGGCTAGAGGTAAA 
CGGCGGCCAGCCCGACGCCGGCGACGTGGCCGCTGCCGTGGCCCTGGCCTACTGTGACCGCGCTGGTGAG 
CGCCTCACCTGCGAGCTGCAGCCGCCGCCGGCCTCAGCGCTGCCAGGCCCCGATGGCGCGGCGGTGCCGC 
ACGCGCACGTGGGCGCGGTGCTGCCCGCATGCGACGACCTGGACGCAGCGCTGAGCAAGCTCCGGCGGTA 
CGGCATGGCGCAGCAGCTGCAGCAGCAGATCGTGGGCGCGCTGCGGGACCTGGCGCCAGAGTACGCGTGC 
GAGCTGGCCGCCCTGCCGCTGGGCGCCGAGACCGCCGCCCGGCGCGCCAAGGGCGTGGCGCTCATGCGCG 
GTGTGCTGCGCGCCQCCGCCACCGTGGCCGCCGCCACAGCCAAGCCCGAGGCTGCTGCTGACGACAGCGA 
CGACGACGAGGTGGACCCGCGCAGTGTGCTGGCGGCCGCCCGCCGCATGCTGACCCGCAGCCGCGACGTG 
CTCACCTGCAGCGAGCAGGTGGCCCTGCTGCCGGACGCGCTGCGCGGCAGCGGTGTGTCGCCCACCCCGG 
ACGCGCTGTACGACGGCGCCCTGGCGCACCTGGTGGACGGCTTCCGCAACGGCTGGCCGCACTCCGTGCA 
CCAGGCCGACCAGCTGCTGGCCAAGCTGGAGGCGCAGCAGGCCCGCGCAGCCGCCATGCGCCGCGAGCAG 
TCCGAGCTGGCCGCCGCCGCCGCAGCCCGCCGTGCCATGTACAGCGGTCCCGCCGCCGCCCACGGTCCCA 
CCCTGTACACCAACTACAACAACCCTGCCGGCAGCGGCAATggcgcgccgccgccgccgccccgccccAT 
GCCCATGGTGCCCAGGGGCGACGGCCAGCACGCCATGGCGGCGTCTGTGGCGGCGCATGTGCACTCCACG 
GCGATGGCGGAGCAcgcggcgcgcagcgcggctggcggcgccgccggcgccTCCGATGGCGGCGCGCACG 
CCAACGGCGTGGCTCTAGAGCGGGCCGTGTGCGCCGTCCTGCTGGGTGACTACACCGCGGCGGTGGAGCG 
GCTGGGGCTAGACACGAACGCGGCGGTGGAGCAGGAGCAGCTGCGCGAGTTCGTCCTGGCCCACTCGCCC 
AACGGCCGCGGCGACCTGCGCCCGGGCCTGAGGGCGCTGGCCACCCGCTGGCTGGAGGGCGTGGCGCTGG 
CGTCCTTCCGCGACACTGCCGGCAGCCCCGTGCCGCCGCTGGAGGCCAGCTGGTTCGCGGACCTGCGTGT 
CGCCTTCTATCTGCAGGTATGGCGGCTGTGCCGCGTGGAGCAGGTGCTGGCCGCCGCCCACTTCCTGGCC 
AACCTGCTGCCCAACATGCTCAAGgccatcgccggcactgccgtcaaggtcgcagccaacaccgccgtgg 
cagcctcccgcgcgcagcgcctcagcgccaccgtcgcggccagcaccgccaccgcctcgtcatcttcctc 
tgccgcccgcggcgctcgtgccggtgccctgagcgctgccaccgccgccgcacacgccgcgcgccgccAG 
CAGGCGAACGCGGTCGGTGCCAGCATCGTCGGTGCTGACGTGCTGCCCCCCACAGCAGTGgccqcggctg 
ccgcggctggcacagcggccgccgccgcagtcaccggccccgccctcggccgtggcgctgcagcttccgc 
ctcttcctttgaggagggcgccgctgaggccgctgacctgcgtcgtcgctttgtcgccaccagccgcggc 
gccagcgcggccgtcgGTGCGCCCACAGCACCAGCCGCTATGACTGGGCCCCAGCACGGCGCCGCCTCTG 
CTGCGCAGTCGCACCGGGAGGAGGATGAGGATTCGCACGGCGGCCAGGAGGGGGGCGTGCCGCGGCGCAT 
GAGCGAGGCGGACCTGCGTGCGCACCTGGCGGGCCTGGAGAAGGCCATGTGGGACTCGGAGCTGCCGCCG 
CCGCCGCCATGCCGCGCGCAGAAGGCGCTCACCTACGCCGCAGGACTGCTGGCCGTGGTGGTGGCCTTCC 
TGGTGTCCAGCTTCTTccgccgcaacgacggcgccgcctccgccctggcacccgccgccgtcaccaccgc 
ctccgtggccgTTAGCGCGCAGCCCGCCAAGCCGGGCAAGGCCACCCGCTCCGCGCACTGA 

Protein Sequence 

>genie.294.6 (SEQ ID N0:190): 

MNSAEHVSVAVDYYRMLHVPRVSRPDAIRKAYENLVKQPPAAAYSADTLFARAVLLKAAAESLTDPDLRR 
SYDAKLAAGHTALRVSQQDLPGALWLQEIGEHQLVLDLGLRWLEVNGGQPDAGDVAAAVALAYCDRAGE 
RLTSQLQPPPASALPGPDGAAVPHAHVGAVLPACDDLDAALSKLRRYGMAQQLQQQIVGALRDLAPEYAC 
ELAALPLGAETAARRAKGVALMRGVLRAAATVAAATAKPEAAADDSDDDEVDPRSVLAAARRMLTRSRDV 
LTCSEQVALLPDALRGSGVSPTPDALYDGALAHLVDGFRNGWPHSVHQADQLLAKLEAQQARAAAMRREQ 
SELAAAAAARRAMYSGPAAAHGPTLYTNYNNPAGSGNGAPPPPPRPMPMVPRGDGQHAMAASVAAHVHST 
AMAEHAARSAAGGAAGASDGGAHANGVALERAVCAVLLGDYTAAVERLGLDTNAAVEQEQLREFVLAHSP 
NGRGDLRPGLRALATRWLEGVALASFRDTAGSPVPPLEASWFADLRVAFYLQWRLCRVEQVLAAAHFLA 
NLLPNMLKAIAGTAVKVAANTAVAASRAQRLSATVAASTATASSSSSAARGARAGALSAATAAAHAARRQ 
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QANAVGASIVGADVLPPTAVAAAAAAGTAAAAAVTGPALGRGAAASASSFEEGAAEAADLRRRFVATSRG 
ASAAVGAPTAPAAMTGPQHGAASAAQSHREEDEDSHGGQEGGVPRRMSEADLRAHLAGLEKAMWDSELPP 
PPPSRAQKALTYAAGLLAVWAFLVSSFFRRNDGAASALAPAAVTTASVAVSAQPAKPGKATRSAH* 



Thermosynechococcus elongatus BP-1 tlr0758 (SEQ ID NO: 191) : 

GTGCGCATTCCCCTCGATTATTACCAAGTGTTGGGTGTGCCTATTCAGGCAACGCCGGAG 
CAAATTGAGCAGGCCTTTCGGGACCGGCTGTTGCAGCTCCCTACCCATCAGCACTCCCCC 
ACCACAGTTGCCACCCGTCGCGAACTCATTGAGCAGGCCTATGCAGTTTTGCGAGAACCG 
GAGCAGCGCGATGCCTACGATCGCCACTGCCGTACCGTTGATCCCGATGATTTGATTGCC 
CAGTTGGATCCCGATGCCACCACTCCCCACATTGAAATTAGTGATGAGCAATTGTCGGGG 
GCACTCCTACTGCTGTATGAACTAGGA7UVTTATGCCCAAGTTGTCAACCTGGGAGACGCC 
TTTCTTAAAAAGGATGTTTTTGAGCGCAATCGCCCCTACACTTCCCCTGCCGCCGTTGCC 
GACATTACCeTCACTGTGGCT-TTGGCCTATCTGGAATTGGGACGGGAGGAATGGCAGCGG 
CAGTCCTATGAATCAGCCGCCTCTCAGCTAGAAGCCGGTCTCCAGGTACTTCAGCGGGTA 
AATTTGTTTCCCGAGCTCCAGGAGCAGTTTCAGACGGAACTGAATCGGCTGCGTCCCTAC 
CGCATTCTGGAATTACTGGCACTGCCTTTGTCCGATAGTGCGAATCGGCAGCGGGGTATT 
TTATTGCTGCGGCAAATGCTGAGTGAGCGCGGGGGCATTGAGGGGCGCGGTGACGATCGC 
TCAGGACTAACAGTTGAGGATTTTCTGAAATTTATTTTGCAACTGCGCAGCCATCTTACC 
GTGGCAGAACAACAGGAACTCTTTGAACGGGAATCGCGGCGTCCCTCAGCGGTGGCCACC 
TACCTTGCGGTACATGCCTTGGTAGCACGGGGAGTGCATGAACTGCAGCCGAGCTATATT 
TGTCGGGCCAAGGATTTATTGCAGCAGCTGCTCCCCCATCAAGACGTCTATCTTGAACTT 
GCCAGTTGCTTGCTGCTTTTGGGACAGCCCACCGAGGCCTTGGCAGCTCTTGACCACAGC 
CAAGATCAACCGACTCTGGACTTTATCCGCCGTCATGCCGGTGAGGCTGGCGATCGACTG 
CCGGGGCTTTATTACTACAC.CACACAATGGCTCACGGAGGAAATTTATCCTGCATTTCGG 
GACTTGGGGGAAACACCCGTGGCCTTGGAGGCTTACTTTGCTGATGCCAATGTCCAAACC 
TATCTAGAGGCTCTCAGTGAGGACTCCATTGCCCCTGAACCCCCTGCGACCACTGCCTCT 
GCGCTCCCTGAAGTGATCAGACCAACGGTGGCCGTGCCCCCTCCCCTCTCCTTCACAGCG 
GAAACGTTACCGTTGCAGGATCAGAGTCGGCTGGGTCAGGGCCTTTCGGCATCGGCTTTT 
ACCCCTTCTGCAACTGCAACGGGGACATCGATGCCCCAACCATCGCCTCGCAAACGGCGC 
AGCCCTCGAAACCGTTGCGCCCAAAAACGTCAGACTTGGTTTTGGATGGGTGCAGGAGTG 
GTTCTTGTGGGTTTAGGGGCGTTGGCAAAAGTCTATTGGCCCGCCAAAACCGCTGAAGCC 
CCCCCGCCGCCGGTGACACCGGCACCAACTCCTGTGGCAACGCCGACCCCAACGCCACAA 
CCGACGACCTTAGCCATCACTTTAACACCAGAGATGGCGCGCGATCGCCTCCACACTTGG 
CAGCAAATTAAAGCCCAAGCCCTTGGGCGACCATTTGAGGTGGACAAACTAACAACGATT 
TTGGCGGAGCCAGAACTCAGCCGCTGGCGATCGCGGGCACAGGGCTTAAAGTCCGAGGGC 
AGCTATTGGGTTTATACCCTAAAGAACTTAGAAGTGAAGGAAGTCCGCCTCCAAAGGAGC 
GATCGTGTGGAGGTGTTGGCAGAAGTCAACGAGGATGCCCGTTTCTATGAACAGGGAACC 
CTGCGCACTGATATTTCCTATAGCGATCCCTACCGGGTCATTTATACCTTTATCCGTCGC 
GGCAATCAATGGTTGATTCAAGGCATGCAGGTGGTTAGTTAA 



Protein sequence: 
>tlr0758 (SEQ ID NO: 192) : 

MRIPLDYYQVLGVPIQATPEQIEQAFRDRLLQLPTHQHSPTTVATRRELIEQAYAVLREPEQRDAYDRHCRTVDP 
DDLIAQLDPDATTPHIEISDEQLSGALLIiLYELGNYAQVVNLGDAFLKKDVFERNRPYTSPAAVADITLTVALAY 
LELGREEWQRQSYESAASQLEAGLQVLQRVNLFPELQEQFQTELNRLRPYRILELLALPLSDSANRQRGILLLRQ 
MLSERGGIEGRGDDRSGLTVEDFLKFILQLRSHLTVAEQQELFERESRRPSAVATYLAVHALVARGVHELQPSYI 
CRAKDLLQQLLPHQDVYLELASCLLLLGQPTEALAALDHSQDQPTLDFIRRHAGEAGDRLPGLYYYTTQWLTEEI 
YPAFRDLGETPVALEAYFADANVQTYLEALSEDSIAPEPPATTASALPEVIRPTVAVPPPLSFTAETLPLQDQSR 
LGQGLSASAFTPSATATGTSMPQPSPRKRRSPRNRCAQKRQTWFWMGAGVVLVGLGALAKVYWPAKTAEAPPPPV 
TPAPTPVATPTPTPQPTTLAITLTPEMARDRLHTWQQIKAQALGRPFEVDKLTTILAEPELSRWRSRAQGLKSEG 
SYWVYTLKNLEVKEVRLQRSDRVEVLAEVNEDARFYEQGTIiRTDISYSDPYRVIYTFIRRGNQWLIQGMQWS 
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Tricbodeamivm erythraeum 

Contig97 Gene 8639 
(SEQ ID NO: 193) : 

GTGCGGATTCCATTAGATTATTATCGAATTTTAGGTTTACCAATTCAGGCTACTGCTGAACAGTTGCGGCAGGCA * 

CATCA 

AGACCGCACTCAGCAGTTTCCTAGAAGGGAGTATTCTGAAGCCACAATAGTTGCTCGTAAACAGCTTATAGATGA 
GGCTT 

ATGCTGTTCTTTGCGATCCTGAACAACGTCAAACCTATGATGGTAACTTTTTAGCTAAAACCTACGAGCCAATAG 
TAGAA 

GAACTCAATCCAAGTTCTCAGATAAATTTTGATCAAGCACAAGAAAAAGAAACCACACTTAAGGAGACTAGAGAA 
GTTCT 

TCCGGAAATAGCTTCTAAACAGTTAAAAAAAAGGACAAGTTATCAAAACAGAGAGACTAAAGCTGCCTCTGATTT 
TCATT 

CTAATACCCCTAGTATAGAAATAGAATATCCACAATTTGTGGGAGCCATCCTAATTTTACATGAGCTAGGAGAAT 
ATGAG 

CTAGTATTAAAAATAACTCACCCTTATCTTCTTAACAATAGTATAACTATTAAAGATGGACGTTTTGGAGACCCA 
GCATT 

AGTTTTGCCAGATGTTGTCCTTACAGTTGCTCTAGCAAATTTAGAATTGGGCAGAGAGGAATGGCAACAAGGACA 
ATACG 

AAAGTGCAGCTACAGCTTTAGAGGCTGGCCTAGGGTTATTGCTACGAGAAAACCTATTTGTCCAAATACGAGGAG 

AGATA 

CAAGCTGACCTTTATAAGCTACGTCCTTATAGAATAATGGAGCTAATAGCACTACCAGAGGAAATAGCTCTAGAC 
CGTAG 

CCGTGGACTAGAAATTCTTCAAGATATGCTCAATGAACGGGGAGGAATTGATGGTCAAGGTGAAGATAGCTCTGG 
ACTTG 

GGATAGAAGATTTTCTAAAGTTTGTTCAGCAGCTACGTCAATACTTAACTACAGCAGAGCAAAAGAAGTTATTTG 
AGGCA 

GAAGCCCTTCGCCCTTCCGCAGTTGGTGCATATCTAGCGGTTTATACTTTTTTAGCTCAAGGGTTTGCTCAAAAA 

CAACC 

AGCCTTTATTCGTAAAGCTAAGTTGATGTTAATGCAATTGGGTCGGAGTCAAGATGTAAATTTAGAGAAATCTGT 
CTGTG 

CTTTACTTTTAGGGCAAACTGAAGAAGCTAGTCGTTCATTAGAACTTAGCCATGAAAATGAACCTCTATCCTTTA 
TTAAA 

GAAAATTCTCAACAATCTCCAGATTTATTGCCAGGTCTATGTCTCTATGCTGAACATTGGTTGACAGAGGAGGTT 
TTTCC 

ACATTTCCGTGATTTGTCTGACAAGTCAGCTTCTTTGAAAGATTATTTTGCAGATGAACATGTTCAAGCTTATCT 
AGAAG 

CTTTACCTACAGAAGCAGAGGTAGCTAATCAATGGGTAGTCGTTCAGCCTCGTCGTAGTAATCACAATAAAAAAC 
AAATG 

TTCGACCCCAAGGAACTTGAGAAGTTGAATGTATCAGATTTGGAGGATAAAGATATTTCTCGGGTAGATGCTACT 
GCTAC 

TGGTATTGTTGCTTCTGGAAGTCAAGGAAGTTCTAATTTACTAGGGGCTAGTTCTGATGGGTTGCTTCAAGAATT 
AGAAA 

AATCATCATCTACTAGAGGTGGGCCAAAACAAGTAACTACTAAGAGTTCTAGTCACTATTTAGGAAAAATTAGGG 

AAAAG 

AGTATAAGTGGTTTACCTGAGTTTAATGAAAGTACATCTATTGAGAGTGGGGGGTTACCCCAATCTATCCAAGAG 
CATAG 

TTCACGTAGAACTTCTGCTAGAAGAGAACCTGTTAAGTTTGGTCGTTTAATATTAATCGCAATTGTGGGATTTTT 

GTTAA 

TAGGATTTATTGGGTTGTTAACAATTAAAACTATCGGCTGGTTAGTAAATGCTTTAGGATGGGAAAGAGAATUUVC 
TGATG 

ATACAATTGGATAGGCCTCCTATAGAAATCCCAGAACCTGATCGGGTTAACCTCGCAGCATCAGGACCGATAACA 
AAAGA 

AGTAGCAAGGCGAACAATTCAAAGTTGGTTAGATATCAAGGCTTCTGCTCTTGGTCCTAATCATAAAATTGAACA 
ATTAC 
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CAAATATTTTAGTAGAACCGGCACTTTCTCGTTGGTTACCTACAGCTAATGCCCTGAAGCAAGAAAAGTCATACC 
GTAGG 

TATGAGCATGATTTAGAAATAAGTAATATAAAGATGAGTAATACAAATTCTAATCTCGCTCAAGTAGATGCTAAA 
GTGAT 

AGAAAAGGTAGAGTTTTATTCTGACAATGGTAGATTAACTAATACTAACAATGAAAACTTATTTGTTCGTTATGA 
TTTAG 

TTCGTAAAAGTCAAAAATGGCAAATTAGTAATTGGAAGGTATTGAGATAA 



PROTEIN (SEQ ID NO: 194) : 

VRIPLDYYRILGLPIQATAEQLRQAHQDRTQQFPRREYSEATIVARKQLIDEAYAVLCDPEQRQTYDGNFLAKTY 
EPIVE 

ELNPSSQINFDQAQEKETTLKETREVLPEIASKQLKKRTSYQNRETKAASDFHSNTPSIEIEYPQFVGAILILHE 
LGEYE 

LVLKITHPYLLNNSITIKDGRFGDPALVLPDVVLTVALANLELGRE 
IRGEI 

QADLYKLRPYRIMELIALPEEIALDRSRGLEILQDMLNERGGIDGQGEDSSGLGIEDFLKFVQQLRQYLTTAEQK 
KLFEA 

EALRPSAVGAYLAVYTFLAQGFAQKQPAFIRKAKLMLMQLGRSQDVNLEKSVCALLLGQTEEASRSLEIjSHENEP 
LSFIK 

ENSQQSPDLLPGLCLYAEHWLTEEVFPHFRDLSDKSASLKDYFADQHVQAYLEALPTEAEVANQWVWQPRRSNH 
NKKQM 

FDPKELEKLNVSDLEDKDISRVDATATGIVASGSQGSSNLLGASSDGLLQELEKSSSTRGGPKQVTTKSSSHYLG 

KIREK 

SISGLPEFNESTSIESGGLPQSIQEHSSRRTSARREPVKFGRLILIAIVGFLLIGFIGLLTIKTIGWIiVNALGWE 
REKLM 

IQLDRPPIEIPEPDRVNLAASGPITKEVARRTIQSWLDIKASALGPNHKIEQLPNILVEPALSRWLPTANALKQE 
KSYRR 

YEHDLEISNIKMSNTNSNLAQVDAKVIEKVEFYSDNGRLTNTNNENLFVRYDLVRKSQKWQISNWK^ 



Fig. 9 
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56041 actgtaaatt ttgataaata aaaaaaaaca aaaaaaagat cgccaaatca tatttcatac 
56101 tatcagattt aaacaatata atttgttcga cgatacagaa atattttacc tcacaggaag 
56161 aggttgcgca gaaggagcca tggatgtgtt tgttcgagtc gagttgcttt gttgtaagta 
56221 ggtaattgca agaaacttga gttgtctata aagctttgga atacttctct ttatatatac 
56281 gtttacaaca attttttttt tttttttttt tctattttta caacaaattg ttttttatta 
56341 taataataaa cttaaacgaa aataaataat atctctttgt tctatttctt aaaaaagaaa 
56401 ttagcttgta gtacttcaac gtatcttaac tctttagtct ttagtaggta tatatcatct 
56461 atttatttat ttttattttt tttatattac gattatagtg tacgtacgta tttattaatc 
56521 aaaaataact tggtagaagt aaaaagaaaa tgattttttt tttactcagt gatcagtttt 
56581 acgtttattc aaaaataagt tgtagtttcc ttcttaatat tcaagttata tgactaaaaa 
56641 ttggtcggtt aatttactat taagattaat cggaaactct agttagatca cgagataatc 
56701 atcacgtgga gaaacatttg gttcttgtca cgtggagaaa acgttaagct tattttttac 
56761 ttctttatta tatttttgag gaaatggttg aaagaaagag agtgtttaaa atgtgaatgc 
56821 gctcgtagtt aggtggaggt taatgggtag gagggtaggt catatgtgta ttagtgatgg 
56881 ataaaaatta aaaacataaa aaaaacttca agctgtaaat aatctaataa aagaacatag 
56941 aaatataatc aaagaaccat ttaactaaat aaatactttc gattcaaata gcatatttct 
57001 aagttccaag aatagctatc ctctatccac atgttacatt ttttttttct ttttcacatc 
57061 catatagttt ttaaaataat tttctagatg gtatttttta ttcgacattt ttttttcctt 
57121 ttagatttac tgattataat ttatttagaa ataaatgata cgactgtcgt ttctacaaaa 
57181 ctgaaatttg caaacattgg accaaaaagc gaaaccttaa tcacttgaaa cgacaacgtt 
57241 ctttagtatg tttttggaca tacaaagtac acataagatg ttccctcact cttcgattgt 
57301 ttcttaacct aatataatta agcaatattg aacttgagtc actcaatgct gcaccgaagg 
57361 agcctttaga ttttgagcaa attcatgaga gtttagcttc tcattcatca ctctgaattt 
57421 ctcttttatc ctctttatct gtccaaaaca tgacacataa cataatgtta gttctcctgc 
57481 atacttccaa tggcaaatag aaaaaagaga cattgatcat agaagtcagt ttggtttacc 
57541 cttctgagct cgatctctgt gctccgtttc ttttgatcaa gtgattgccg gagattcgtg 
57601 atgtcgaaga tactatcgag gtcgtcttca aatgcgtttt ccaactcttc ccggagaaga 
57661 gcaggtaact tatcaacgat gggcattaga agaaaacagt tgaactgcag aacaaaagaa 
57721 aacacagata caaacttttt aaaagaaaag tcattttaaa agcaagaaga atctgagtaa 
57781 aaactgaagt aggagcaaac ctttaactca gcagaggcga gaaagtactc tcgtatgccc 
57841 tggaatatct gttggaccaa tgcgtacaca attctctcag aggaaggagc aagcttgcgg 
57901 ttccaaagtg tgctatctag aagatcagcc aaccgcattt ctgttgtctg aatactggaa 
57961 cctgaatcga tgtttgaggc gagatggctt agctttacat ctgatcttga cttggtgtct 
5 8021 gttgtgccac ctaatgcatc ttggggaaga ctaaatccta tggcattacc tgatgtcgta 
58081 ttatgctctg ttccaccaaa tgagtccaag aattgacgta gaccagctcg gttctacata 
58141 acattgagaa acgaaaacta ctcaatcaga aacggatact tgatggtatg tacacaactc 
58201 aattggattg aaacagagct atagggctgt agcaatgacc ttgttgtgaa gagaccatgt 
58261 aacatagcga gttgtacttg ctaaatcctc catacatctg caaacaatat aaaatccaaa 
58321 gggtgatcaa tcactaaagc tcactagaac acaggtagga ggcaccgaca tggtaagaac 
58381 aggaattgga aatagaatta cttgtcacga catgattttt ctgtggactc cacaaaactg 
58441 ttgaatgctg aagcaacccg cttgagaaac acctcatgcc cacttaaata ttcaccttct 
58501 ttctattcaa atttagaaca tacatcaaaa aatttgctgg aaagggatca tgagtatgat 
58561 accgtcaaac caaagaaaac agtacctacc tgaagaagat atacagaaat tggaagcaat 
58621 ctcttgagaa tgtgtagaag cctcgcccct aactatatca acgcaaaaca aacgaaaatg 
58681 agaactggaa aaaactttct gtatggaaag agaaacatgt gaataacaaa atttcagatg 
58741 aaagtattcc caaacatagt ttctgtaagc agaacatgtt tactcgataa ctcttatgca 
58801 caaataagtt ccagcaaatc tcaaaactga atggtagtat gatttcaata tataacgtta 
58861 tatttcattt ttttttttac gtacagtaca ccttaactaa ttagtaaaat tgctttccat 
58921 cctccacgaa agaaaaagaa aaaagtagct atatctatgt cacctgatga aggaaaggtt 
58981 caaacgtctc acgagccttc gcaactgcta taacacaagc tgttctacaa cagcaaataa 
59041 gagaaagaga ataagaggcc atagaaaaca tgacaaacgt tgcagctcag attagatact 
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59101 gaaaggggtc tgggatgcaa agacaataaa ttgagaagtg tgttgcatgt cagtcaatcc 
59161 tatgatacct ggaatagttt gttccatcat gaatatcctc aactccacat gcatttacaa 
59221 tttcctccct cgttattggg ggacatttga tagcaccaac tagaaaacga aactcagcca 
59281 tggcacggtg atattgtgca cccccataga gacgcatccc tgcattctgt aaaatgaaag 
59341 ataatctggt tatggtctct cataattctt gaaggtccaa cgaagtatct cttttatttg 
59401 tttccaatac attattcttt ggcacatatg tttcatgcgg tcaaatttat cttccatcat 
59461 attataatcc atgtacaaga acaagacaac tggatttgaa gaccatgccc agcttgctct 
59521 ataaagtcca acaatattct gcttcaggga aagacttacc ggtattagct tatgtgaaaa 
59581 ctggagacca tcagtaccaa caaatgctcc tccttgtgtc ctttcatctt gcagtgtctc 
59641 acctgaaaaa caccatgaga aattattaac aatcaaagaa cccaacataa agagaatgct 
59701 gttataaaat gtgcttctgc cagtaaccaa agtatcatga ccaatgattg attgattagc 
59761 atacatcatt ccatgtgtaa tcatcgcagt ctggtgaccc agtcgaattg aacaatatgc 
59821 atttaactaa actgattttg caaaagtcca atttaacaac acccagaaac aagaaaagtt 
59881 tatgccaaag aagttgacta gcagagaaca gagcagtaac attaccaaat ttatctggag 
59941 gggccacaac tgttcccttc aataacagcg ataactgatc aagaaaaata taaacaaaac 
60001 aggtgagaaa acacagcact gatcaatact aacaaaggta cttcgtacgt caatcagaaa 
60061 atatgacgca gcaattttaa agtcttaagg gcatccaaca caaaaagttt acagccattc 
60121 tgaatttgta gcaagtccta gatatcattt actgtagcat aattttatat gtgtcagtaa 
60181 tcaataaaca aatttgtttt tatgtgtcag tagttaataa accaaaaaaa aagagaagtt 
60241 tacacaaatg aacttgttgt aattatacaa aaactattaa tccacgagtc caggcaaaaa 
60301 tgaaaaggta tgggaaggtg taaatagaaa tctaaaaaaa cgaaatgctc tctacagtta 
60361 ccttggttaa gaagagatca tggaaagtcc tgcctctctc tttgagtttt gcttcatcca 
60421 aagagctgca ttgaaaggaa ttattcaacc tccaatgagt tatattttct ataaatcagt 
60481 agctaacaat taaactgcct aaaatcaagt agacattttc agacaaaaca aattgcgacc 
60541 taagttcctt gctcacggta tccagctttc tgactgtact gcggtactcc tttcctaaca 
60601 gtggaatgat caatggaaca ctctctttgt acctggaaag agaagggcat caagactaca 
60661 gcgaaaagta aactacaata gaaacagagg ctggaaaaat cagagttaaa acaacagtta 
60721 taccttttcc agagtagttc ttccagaaac aacctcagtt tactgatgcc aatcctactc 
60781 ttttcctgtt ttgtcagtaa acggcccaac ttcttctcta aagatgcaat gtcttccatt 
60841 tctctaagtg acacagcctg taataaaaac cacacatagt ttagaaaaag acctgtttaa 
60901 cttgtttaag gaatcagaca gcagagcaga gacctgtttg aactcgtcat tagacttata 
60961 cactgaatcc tgtccatagc caactcttcc agaaggcaca gacgtgaaaa aaggagaatc 
61021 gcccaataag gagctgtcaa gtgcgcttgc aggaggtgag agaaagactt ccacgtcaga 
61081 tgaacatgag aattgaggga ttttagtgtc aagctttgta gaaacaacaa ttgtcctaga 
61141 aagctcagga tcaacctaca tgaacgagaa acaaactt.ta acaaaaataa ,agacaaggtt 
61201 agacgcaatg gagttacgtc aagcaacgta cttgcatcac tatccttcga gtggttgcaa 
61261 tgctccagtc actgctatct tcgaggcata aaatgatgaa ctctttgtgt tgcatctttg 
61321 ctcggactag agcttccaca gcccgtgctt gaacctaaga aaaagaacaa gtaacccact 
61381 ctcaaataaa gcaaaaccaa aacatgaaat cagccacgga attggctgga agccataaga 
61441 aaaaacaacc tgaagagctc ggtttttcag tcctggtgca ggagcaataa gtccaggtgt 
61501 atcaatgatg gtaaggtttg gacaatactt atactggact ttcacaataa tctcctttgc 
61561 agagaatggg ctacatggct cttgctccag cctcatgttc tcagcctcaa tatatgccta 
61621 actccaaatc atataacaaa tttcgttaac atgagcattt cgcttctcta caataaacct 
61681 aagtacttgt gtttctcaac attcgtcaaa atcttcccag aatttatacg cagaaacaag 
61741 caattgaaga agcacaagta ataataataa caaaacacct gaatttgtga gagagatttg 
61801 ggaagagaaa cggaaggatc atcatcagat ccgagatgac aaagcgggaa ttgacactga 
61861 ggatcgtact tcatatggag agtaatcggc cgacgagtct tggttccgcc gccgacatgg 
61921 ttaaattgaa accccataag agcttccaca agcgcacttt taccgtcggt ctgctgtccc 
61981 accacaagaa ccgccggtgc ttcgaacggc gtctccaatt cctgcgccaa agcgtgtaac 
62041 tcgttgtaag cttcgtaaag actccaccgc tcctcaatcg cagcgtcgtc ctcttccgcc 
62101 atttcctcaa ccgtcaccga ttttgctgat acttccgcca tcgtctctta cgaaaatgag 
62161 caagaggaag agtaagagta agagagtgtc tcttatttct tctactcttt agttttcgtc 
62221 gccgttcctt tttccgccat ggaattagca gatacggcta atttcaattt ttgtcaaaag 
62281 aaatattttt tgtgttttaa tctcacgcgc atccatggcg cgttgagtca acgttgtaat 
62341 agttctccgc taaatttaaa taaaagagcg cgtaaggaga gagtttaagg attttttttt 
62401 tttggtcggc aaatacaaag gatttgcttt gtcttgacca atagtatatg cagaaatatt 
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62461 atctcaaagg atttgtgata actatgtagt acagaattgt. gattattgga tgagaaacca 
62521 gaaatatttt gagcaaatga cgacttgtta atttactatt ttttcatttc ttaaaggtct 
625 81 ctcttgtgta actatgatta aaattgaaat agtgactttt attgttacga catggaacaa 
62641 atcaacgagt tctattgtta aagagagaca ttgatgaatg taacaaaact gtggcttaga 
62701 agccgaaagg agacttagtt cgggtccctc cttcaccgta ttgctcgttc cattttctca 
62761 attcgttcat tgtcgtcgcg tcgtatgcca ctgacggact tacctgcaaa ttacattaca 
62821 atgacgcaat ttcgataatg caaacaccag gggaaaaaac atgaatagag atgatgatga 
62881 tgttttttaa gagattgatc aataccttag ctttggattg aatgaagtcg tccaaactca 
_62941 gtggtcgtag atcaggggac gcatttgtta ccgagtcctg ataattcgac gtttcaaaag 
63001 catggagtga gtacaaaaat tatttttcgt aacaacagaa atcaactgtg tgggtttatg 
63061 catgtcctta ccttgttttc ttcttgtaac aattcttgaa caggtctgta tgcagctgct 
63121 atgcatagat tctgcaatgt aagaaaagaa aaggaatcag aactactgtg ttgaatcata 
63181 ctcgaacttg taaatgaaac cccgaatgac caaaccttta gatcgcttcc tgaatatcct 
63241 tcggtttcct ttgcaagttt atcaaactcg aaaccagttt caagattttc tggtgtcaga 
63301 aatatcttca atatcttcaa ccggttttcc gcatctggta aatccacata tatcctataa 
63361 acacaagcct caatacaatt atcgaaaaga tacaaatatt ccaaaggaga aattacttga 
63421 aagcttaaat taccgtcttg gtagcctacg aatgacagcg tcatcaagat caaaaggtcg 
63481 gttggtggca ccgagaatga gaatcctttg gctatctttt gatctgagtc catcccaagc 
63541 tgccataaac tcatttctca ttcttcgtgt tgcctcgtgc tcaaaagcac caccacgagc 
63601 acccaacaaa ctgtcaacct atacgacaac aaaataaatt acagttagtc cttgagtaac 
63661 acattttacg catcacaaaa gtattcctca taaaaagcaa taaccgaaat tgaaaagtga 
63721 tataaagcta aacaatttct cacctcatca acaaatataa tgacgggggc tagtttgctt 
63781 gcaaaagaga acaaagcctt cgtgagcttc tctgcatctc caaaccactg tgccaaacaa 
63841 tggacgaaat tgacttaaat cagaaccaat cagaggtaaa gttggaaaga gatttactct 
63901 aagttacaat cggcattgac aataataagt cgatgaccgg ggtggaaaag tttttcttat 
63961 gtcattagat attctcctta tttatatgaa gatgtttaca aagtggaata tcaacgtgac 
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SEQ ID NO:12 



gaaa ttagccgtat ctgctaattc catggcggaa aaaggaacgg 
cgacgaaaac taaagagtag aagaaataag agacactctc ttactcttac 
tcttcctctt gctcattttc gtaagagacg 

ATGGCGGAAGTATCAGCAAAATCGGTGACGGTTGAGGAAATGGCGGAAGA 

GGACGACGCTGCGATTGAGGAGCGGTGGAGTCTTTACGAAGCTTACAACG 

AGTTACACGCTTTGGCGCAGGAATTGGAGACGCCGTTCGAAGCACCGGCG 

GTTCTTGTGGTGGGACAGCAGACCGACGGTAAAAGTGCGCTTGTGGAAGC 

TCTTATGGGGTTTCAATTTAACCATGTCGGCGGCGGAACCAAGACTCGTC 

GGCCGATTACTCTCCATATGAAGTACGATCCTCAgTGTCAATTCCCGCTT 

tGTCATCTGGGATCTGATGATGATCCTTCCGTTTCTCTTCCCAAATCTCT 

CTCACAAATTCACGCATATATTGAGGCTGAgAACATGAGGCTGGAGCAAG 

AGCCATGTaGCCCATTCTCTGCAAAGGAGATTATTGTGAAAGTCCAGTAT 

AAGTATTGTCCAAACCTTACCATCATTGATACACCTGGACTTATTGCTCC 

TGCACCAGGACTGAAAAACCGAGCTCTTCAGGTTCAAGCACGGGCTGTGG 

AAGCTCTAGTCCGAGCAAAGATGCAACACAAAGAGTTCATCATTTTATGC 

CTCGAAGATAGCAGTGACTGGAGCATTGCAACCACTCGAAGGATAGTGAT 

GCAAGTTGATCCTGAGCTTTCTAGGACAATTGTTGTTTCTACAAAGCTTG 

ACACTAAAATCCCTCAATTCTCATGTTCATCTGACGTGGAAGTCTTTCTC 

TCACCTCCTGCAAGCGCACTTGACAGCTCCTTATTGGGCGATTCTCCTTT 

TTTCACGTCTGTGCCTTCTGGAAGAGTTGGCTATGGACAGGATTCAGTGT 

ATAAGTCTAATGACGAGTTCAAACAGGCTGTGTCACTTAGAGAAATGGAA 

GACATTGCATCTTTAGAGAAGAAGTTGGGCCGTTTACTGACAAAACAGGA 

AAAGAGTAGGATTGGCATCAGTAT^CTGAGGTTGTTTCTGGAAGAACTAC 

TCTGGAAAAGGTACAAAGAGAGTGTTCCATTGATCATTCCACTGTTAGGA 

AAGGAGTACCGCAGTACAGTCAGAAAGCTGGATACCGTGAGCAAGGAACT 

TAGCTCTTTGGATGAAGCAAAACTCAAAGAGAGAGGCAGGACTTTCCATG 

ATCTCTTCTTAACCAAGTTATCGCTGTTATTGAAGGGAACAGTTGTGGCC 

CCTCCAGATAAATTTGGTGAGACACTGCAAGATGAAAGGACACAAGGAGG 

AGCATTTGTTGGTACTGATGGTCTCCAGTTTTCACATAAGCTAATACaGA 

ATGCAGGGATGCGTCTCTATGGGGGTGCACAATATCACCGTGCCATGGC 

TGAGTTTCGTTTTCTAGTTGGTGCTATCAAATGTCCCCCAATAACGAGGG 

AGGAAATTGTAAATGCATGTGGAGTTGAGGATATTCATGATGGAACAAA 

CTATTCCAGAACAGCTTGTGTTATAGCAGTTGCGAAGGcTCGTgAGACGT 

TTGAACcTTTCCTTCATCAGTTAGGGGCGAGGCTTCTACACATTCTCAAG 

AGATTGcTTCCAATTTCTGTATATCTTCTTCAGAAAGAAGGTGAATATTT 

AAGTGGGCATGAGGTGTTTCTCAAGCGGGTTGCTTCAGCATTCAACAGTT 

TTGTGGAGTCCACAGAAAAATCATGTCGTGACAAATGTATGGAGGATTTA 

GCAAGTACAACTCGCTATGTTACATGGTCTCTTCACAACAAGAACCGAGC 

TGGTCTACGTCAATTCTTGGAcTCATTTGGTGGAACAGAGCATAATACG 

ACATCAGGTAATGCCATAgGATTTAGTCTTCCCCAAGATGCATTAGGTGG 
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cacaacagacaccaagtcaagatcagatgtaaagctaagccat 

ctcgcctcaaacatcgattcaggttccagtattcagacaacagaaatgcg 

gttggctgatcttctagatagcacactttggaaccgcaagcttgctcctt 

cctctgagagaattgtgtacgcattggtccaacagatattccagggcata 

cgagagtactttctcgcctctgctgagttaaagttcaactgttttcttct 

aatgcccatcgttgataagttacctgctcttctccgggaagagttggaaa 

acgcatt_tgaagacgacctcgatagtatcttcgacatcacgaatctccgg 

caatcacttgatcaaaagaaacggagcacagagatcgagctcagaagggt 

aaagaggataaaagagaaattcagagtgatgaatgagaagctaaactctc 

atgaatttgctcaaaatctaaaggctccttcggtgcagcatHI 

gtgact 

caagttcaatattgcttaattatattaggttaagaaacaatcgaagagtg 
agggaacatcttatgtgtactttgtatgtccaaaaacatactaaagaacg 
ttgtcgtttcaagtgattaaggtttcgctttttggtccaatgtttgcaaa 
tttcagttttgtagaaacgacagtcgtatcatttatttctaaataaatta 
taatcagtaaatct 
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SEQ ID NO:13 



MAEVSAKSVTVEEMAEEDDAAIEERWSLYEAYNELHALAQELETPFEAPAVLWGQQTDGKSALVEALMG 

FQFNHVGGGTKTRRPITLHMKYDPQCQFPLCHLGSDDDPSVSLPKSLSQIHAYIEAENMRLEQEPCSPFS 

AKEIIVKVQYKYCPNLTI IDTPGLIAPAPGLKNRALQVQARAVEALVRAKMQHKEFI ILCLEDSSDWS lA 

TTRRiVMQVDPELSRTI-WSTKLDTKIPQFSCSSDVEVF.LSPPASALDSSLLGDSPFFTSVPSGRVG^ 

DSVYKSNDEFKQAVSLREMEDIASLEKKLGRLLTKQEKSRIGISKLRLFLEELLWKRYKESVPLIIPLLG 

KEYRSTWKLDTVSKELSSLDEAKLKERGRTFHDLFLTKLSLLLKGTVVAPPDKFGETLQDERTQGGAFV 

GTDGLQFSHKLIQNAGMRLYGGAQYHRAMAEFRFLVGAIKCPPITREEIVNACGVEDIHDGTNYSRTACV 

lAVAKARETFEPFLHQLGARLLHILKRLLPISVYLLQKEGEYLSGHEVFLKRVASAFNSFVESTEKSCRD . 

KCMEDLASTTRYVTWSLHNKNRAGLRQFLDSFGGTEHNTTSGNAIGFSLPQDALGGTTDTKSRSDVKLSH 

LASNIDSGSSIQTTEMRLADLLDSTLWNRKLAPSSERIVYALVQQIFQGIREYFLASAELKFNCFLLMPI 

VDKLPALLREELENAFEDDLDSIFDITNLRQSLDQKKRSTEIELRRVKRIKEKFRVMNEKLNSHEFAQNL 

KAPSVQH 
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SEQ ID NO:14 

56041 actgtaaatt ttgataaata aaaaaaaaca aaaaaaagat cgccaaatca tatttcatac 
56101 tatcagattt aaacaatata atttgttcga cgatacagaa atattttacc tcacaggaag 
56161 aggttgcgca gaaggagcca tggatgtgtt tgttcgagtc gagttgcttt gttgtaagta 
56221 ggtaattgca agaaacttga gttgtctata aagctttgga atacttctct ttatatatac 
56281 gtttacaaca attttttttt tttttttttt tctattttta caacaaattg ttttttatta 
56341 taataataaa cttaaacgaa aataaataat atctctttgt tctatttctt aaaaaagaaa 
56401 ttagcttgta gtacttcaac gtatcttaac tctttagtct ttagtaggta tatatcatct 
56461 atttatttat ttttattttt tttatattac gattatagtg tacgtacgta tttattaatc 
56521 aaaaataact tggtagaagt aaaaagaaaa tgattttttt tttactcagt gatcagtttt 
56581 acgtttattc aaaaataagt tgtagtttcc ttcttaatat tcaagttata tgactaaaaa 
56641 ttggtcggtt aatttactat taagattaat cggaaactct agttagatca cgagataatc 
56701 atcacgtgga gaaacatttg gttcttgtca cgtggagaaa acgttaagct tattttttac 
56761 ttctttatta tatttttgag gaaatggttg aaagaaagag agtgtttaaa atgtgaatgc 
56821 gctcgtagjtt aggtggaggt taatgggtag gagggtaggt catatgtgta ttagtgatgg 
56881 ataaaaatta aaaacataaa aaaaacttca agctgtaaat aatctaataa aagaacatag 
56941 aaatataatc aaagaaccat ttaactaaat aaatactttc gattcaaata gcatatttct 
57001 aagttccaag aatagctatc ctctatccac atgttacatt ttttttttct ttttcacatc 
57061 catatagttt ttaaaataat tttctagatg gtatttttta ttcgacattt ttttttcctt 
57121 ttagatttac tgattataat ttatttagaa ataaatgata cgactgtcgt ttctacaaaa 
57181 ctgaaatttg caaacattgg accaaaaagc gaaaccttaa tcacttgaaa cgacaacgtt 
57241 ctttagtatg tttttggaca tacaaagtac acataagatg ttccctcact cttcgattgt 
573 01 ttcttaacct aatataatta agcaatattg aacttgagtc actcaatgct gcaccgaagg 
57361 agcctttaga ttttgagcaa attcatgaga gtttagcttc tcattcatca ctctgaattt 
57421 ctcttttatc ctctttatct gtccaaaaca tgacacataa cataatgtta gttctcctgc 
57481 atacttccaa tggcaaatag aaaaaagaga cattgatcat agaagtcagt ttggtttacc 
57541 cttctgagct cgatctctgt gctccgtttc ttttgatcaa gtgattgccg gagattcgtg 
57601 atgtcgaaga tactatcgag gtcgtcttca aatgcgtttt ccaactcttc ccggagaaga 
57661 gcaggtaact tatcaacgat gggcattaga agaaaacagt tgaactgcag aacaaaagaa 
57721 aacacagata caaacttttt aaaagaaaag tcattttaaa agcaagaaga atctgagtaa 
57781 aaactgaagt aggagcaaac ctttaactca gcagaggcga gaaagtactc tcgtatgccc 
57841 tggaatatct gttggaccaa tgcgtacaca attctctcag aggaaggagc aagcttgcgg 
57901 ttccaaagtg tgctatctag aagatcagcc aaccgcattt ctgttgtctg aatactggaa 
57961 cctgaatcga tgtttgaggc gagatggctt agctttacat ctgatcttga cttggtgtct 
58021 gttgtgccac ctaatgcatc ttggggaaga ctaaatccta tggcattacc tgatgtcgta 
58081 ttatgctctg ttccaccaaa tgagtccaag aattgacgta gaccagctcg gttctacata 
5 8141 acattgagaa acgaaaacta ctcaatcaga aacggatact tgatggtatg tacacaactc 
58201 aattggattg aaacagagct atagggctgt agcaatgacc ttgttgtgaa gagaccatgt 
58261 aacatagcga gttgtacttg ctaaatcctc catacatctg caaacaatat aaaatccaaa 
58321 gggtgatcaa tcactaaagc tcactagaac acaggtagga ggcaccgaca tggtaagaac 
58381 aggaattgga aatagaatta cttgtcacga catgattttt ctgtggactc cacaaaactg 
58441 ttgaatgctg aagcaacccg cttgagaaac acctcatgcc cacttaaata ttcaccttct 
58501 ttctattcaa atttagaaca tacatcaaaa aatttgctgg aaagggatca tgagtatgat 
5 8561 accgtcaaac caaagaaaac agtacctacc tgaagaagat atacagaaat tggaagcaat 
58621 ctcttgagaa tgtgtagaag cctcgcccct aactatatca acgcaaaaca aacgaaaatg 
58681 agaactggaa aaaactttct gtatggaaag agaaacatgt gaataacaaa atttcagatg 
58741 aaagtattcc caaacatagt ttctgtaagc agaacatgtt tactcgataa ctcttatgca 
58801 caaataagtt ccagcaaatc tcaaaactga atggtagtat gatttcaata tataacgtta 
58861 tatttcattt ttttttttac gtacagtaca ccttaactaa ttagtaaaat tgctttccat 
58921 cctccacgaa agaaaaagaa aaaagtagct atatctatgt cacctgatga aggaaaggtt 
58981 caaacgtctc acgagccttc gcaactgcta taacacaagc tgttctacaa cagcaaataa 
59041 gagaaagaga ataagaggcc atagaaaaca tgacaaacgt tgcagctcag attagatact 
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59101 gaaaggggtc tgggatgcaa agacaataaa ttgagaagtg tgttgcatgt cagtcaatcc 
59161 tatgatacct ggaatagttt gttccatcat gaatatcctc aactccacat gcatttacaa 
59221 tttcctccct cgttattggg ggacatttga tagcaccaac tagaaaacga aactcagcca 
59281 tggcacggtg atattgtgca cccccataga gacgcatccc tgcattctgt aaaatgaaag 
59341 ataatctggt tatggtctct cataattctt gaaggtccaa cgaagtatct cttttatttg 
59401 tttccaatac attattcttt ggcacatatg tttcatgcgg tcaaatttat cttccatcat 
59461 attataatcc atgtacaaga acaagacaac tggatttgaa gaccatgccc agcttgctct 
59521 ataaagtcca acaatattct gcttcaggga aagacttacc ggtattagct tatgtgaaaa 
59581 ctggagacca tcagtaccaa caaatgctcc tccttgtgtc ctttcatctt gcagtgtctc 
59641 acctgaaaaa caccatgaga aattattaac aatcaaagaa cccaacataa agagaatgct 
59701 gttataaaat gtgcttctgc cagtaaccaa agtatcatga ccaatgattg attgattagc 
59761 atacatcatt ccatgtgtaa tcatcgcagt ctggtgaccc agtcgaattg aacaatatgc 
59821 atttaactaa actgattttg caaaagtcca atttaacaac acccagaaac aagaaaagt^^ 
59881 tatgccaaag aagttgacta gcagagaaca gagcagtaac attaccaaat ttatctggag 
59941 gggccacaac tgttcccttc aataacagcg ataactgatc aagaaaaata taaacaaaac 
60001 aggtgagaaa acacagcact gatcaatact aacaaaggta cttcgtacgt caatcagaaa 
60061 atatgacgca gcaattttaa agtcttaagg gcatccaaca caaaaagttt acagccattc 
60121 tgaatttgta gcaagtccta gatatcattt actgtagcat aattttatat gtgtcagtaa 
60181 tcaataaaca aatttgtttt tatgtgtcag tagttaataa accaaaaaaa aagagaagtt 
60241 tacacaaatg aacttgttgt aattatacaa aaactattaa tccacgagtc caggcaaaaa 
60301 tgaaaaggta tgggaaggtg taaatagaaa tctaaaaaaa cgaaatgctc tctacagtta 
60361 ccttggttaa gaagagatca tggaaagtcc tgcctctctc tttgagtttt gcttcatcca 
60421 aagagctgca ttgaaaggaa ttattcaacc tccaatgagt tatattttct ataaatcagt 
60481 agctaacaat taaactgcct aaaatcaagt agacattttc agacaaaaca aattgcgacc 
60541 taagttcctt gctcacggta tccagctttc tgactgtact gcggtactcc tttcctaaca 
60601 gtggaatgat caatggaaca ctctctttgt acctggaaag agaagggcat caagactaca 
60661 gcgaaaagta aactacaata gaaacagagg ctggaaaaat cagagttaaa acaacagtta 
60721 taccttttcc agagtagttc ttccagaaac aacctcagtt tactgatgcc aatcctactc 
60781 ttttcctgtt ttgtcagtaa acggcccaac ttcttctcta aagatgcaat gtcttccatt 
60841 tctctaagtg acacagcctg taataaaaac cacacatagt ttagaaaaag acctgtttaa 
60901 cttgtttaag gaatcagaca gcagagcaga gacctgtttg aactcgtcat tagacttata 
60961 cactgaatcc tgtccatagc caactcttcc agaaggcaca gacgtgaaaa aaggagaatc 
61021 gcccaataag gagctgtcaa gtgcgcttgc aggaggtgag agaaagactt ccacgtcaga 
61081 tgaacatgag aattgaggga ttttagtgtc aagctttgta gaaacaacaa ttgtcctaga 
61141 aagctcagga tcaacctaca tgaacgagaa acaaacttta acaaaaataa ^agacaaggtt 
61201 agacgcaatg gagttacgtc aagcaacgta cttgcatcac tatccttcga gtggttgcaa 
61261 tgctccagtc actgctatct tcgaggcata aaatgatgaa ctctttgtgt tgcatctttg 
61321 ctcggactag agcttccaca gcccgtgctt gaacctaaga aaaagaacaa gtaacccact 
61381 ctcaaataaa gcaaaaccaa aacatgaaat cagccacgga attggctgga agccataaga 
61441 aaaaacaacc tgaagagctc ggtttttcag tcctggtgca ggagcaataa gtccaggtgt 
61501 atcaatgatg gtaaggtttg gacaatactt atactggact ttcacaataa tctcctttgc 
61561 agagaatggg ctacatggct cttgctccag cctcatgttc tcagcctcaa tatatgccta 
61621 actccaaatc atataacaaa tttcgttaac atgagcattt cgcttctcta caataaacct 
61681 aagtacttgt gtttctcaac attcgtcaaa atcttcccag aatttatacg cagaaacaag 
61741 caattgaaga agcacaagta ataataataa caaaacacct gaatttgtga gagagatttg 
61801 ggaagagaaa cggaaggatc atcatcagat ccgagatgac aaagcgggaa ttgacactga 
61861 ggatcgtact tcatatggag agtaatcggc cgacgagtct tggttccgcc gccgacatgg 
61921 ttaaattgaa accccataag agcttccaca agcgcacttt taccgtcggt ctgctgtccc 
61981 accacaagaa ccgccggtgc ttcgaacggc gtctccaatt cctgcgccaa agcgtgtaac 
62041 tcgttgtaag cttcgtaaag actccaccgc tcctcaatcg cagcgtcgtc ctcttccgcc 
62101 atttcctcaa ccgtcaccga ttttgctgat acttccgcca tcgtctctta cgaaaatgag 
62161 caagaggaag agtaagagta agagagtgtc tcttatttct tctactcttt agttttcgtc 
62221 gccgttcctt tttccgccat ggaattagca gatacggcta atttcaattt ttgtcaaaag 
62281 aaatattttt tgtgttttaa tctcacgcgc atccatggcg cgttgagtca acgttgtaat 
62341 agttctccgc taaatttaaa taaaagagcg cgtaaggaga gagtttaagg attttttttt 
62401 tttggtcggc aaatacaaag gatttgcttt gtcttgacca atagtatatg cagaaatatt 
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62461 atctcaaagg atttgtgata actatgtagt acagaattgt gattattgga tgagaaacca 

62521 gaaatatttt gagcaaatga cgacttgtta atttactatt ttttcatttc ttaaaggtct 

62581 ctcttgtgta actatgatta aaattgaaat agtgactttt attgttacga catggaacaa 

62641 atcaacgagt tctattgtta aagagagaca ttgatgaatg taacaaaact gtggcttaga 

62701 agccgaaagg agacttagtt cgggtccctc cttcaccgta ttgctcgttc cattttctca 

62761 attcgttcat tgtcgtcgcg tcgtatgcca ctgacggact tacctgcaaa ttacattaca 

62821 atgacgcaat ttcgataatg caaacaccag gggaaaaaac atgaatagag atgatgatga 

62881 tgttttttaa gagattgatc aataccttag ctttggattg aatgaagtcg tccaaactca 
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SEQ ID NO: 15 



1 atggcggaag tatcagcaaa atcggtgacg gttgaggaaa tggcggaaga ggacgacgct 
61 gcgattgagg agcggtggag tctttacgaa gcttacaacg agttacacgc tttggcgcag 
121 gaattggaga cgccgttcga agcaccggcg gttcttgtgg tgggacagca gaccgacggt 
181 aaaagtgcgc ttgtggaagc tcttatgggg tttcaattta accatgtcgg cggcggaacc 
241 aagactcgtc ggccgattac tctccatatg aagtacgatc ctcagtgtca attcccgctt 
301 tgtcatctcg gatctgatga tgatccttcc gtttctcttc ccaaatctct ctcacaaatt 
361 caggcatata ttgaggctga gaacatgagg ctggagcaag agccatgtag cccattctct 
421 gcaaaggaga ttattgtgaa agtccagtat aagtattgtc caaaccttac catcattgat 
481 acacctggac ttattgctcc tgcaccagga ctgaaaaacc gagctcttca ggttcaagca 
541 cgggctgtgg aagctctagt ccgagcaaag atgcaacaca aagagttcat cattttatgc 
601 ctcgaagata gcagtgactg gagcattgca accactcgaa ggatagtgat gcaagttgat 
661 cctgagettt ctaggacaat tgttgtttct acaaagcttg acactaaaat ccctcaattc 
721 tcatgttcat ctgacgtgga agtctttctc tcacctcctg caagcgcact tgacagctcc 
781 ttattgggcg attctccttt tttcacgtct gtgccttctg gaagagttgg ctatggacag 
841 gattcagtgt ataagtctaa tgacgagttc aaacaggctg tgtcacttag agaaatggaa 
901 gacattgcat ctttagagaa gaagttgggc cgtttactga caaaacagga aaagagtagg 
961 attggcatca gtaaactgag gttgtttctg gaagaactac tctggaaaag gtacaaagag 
1021 agtgttccat tgatcattcc actgttagga aaggagtacc gcagtacagt cagaaagctg 
1081 gataccttat cgctgttatt gaagggaaca gttgtggccc ctccagataa atttggtgag 
1141 acactgcaag atgaaaggac acaaggagga gcatttgttg gtactgatgg tctccagttt 
1201 tcacataagc taataccgaa tgcagggatg cgtctctatg ggggtgcaca atatcaccgt 
1261 gccatggctg agtttcgttt tctagttggt gctatcaaat gtcccccaat aacgagggag 
1321 gaaattgtaa atgcatgtgg agttgaggat attcatgatg gaacaaacta ttccagaaca 
1381 gcttgtgtta tagcagttgc gaaggctcgt gagacgtttg aacctttcct tcatcagaaa 
1441 gttttttcca gttctcattt tcgtttgttt tgcgttgata tagttagggg cgaggcttct 
1501 acacattctc aagagattgc ttccaatttc tgtatatctt cttcaggtag gtactgtttt 
1561 ctttggtttg acggtgaata tttaagtggg catgaggtgt ttctcaagcg ggttgcttca 
1621 gcattcaaca gttttgtgga gtccacagaa aaatcatgtc gtgacaaatg tatggaggat 
1681 ttagcaagta caactcgcta tgttacatgg tctcttcaca acaagaaccg agctggtcta 
1741 cgtcaattct tggactcatt tggtggaaca gagcataata cgacatcagg taatgccata 
1801 ggatttagtc ttccccaaga tgcattaggt ggcacaacag acaccaagtc aagatcagat 
1861 gtaaagctaa gccatctcgc ctcaaacatc gattcaggtt ccagtattca gacaacagaa 
1921 atgcggttgg ctgatcttct agatagcaca ctttggaacc gcaagcttgc tccttcctct 
1981 gagagaattg tgtacgcatt ggtccaacag atattccagg gcatacgaga gtactttctc 
2041 gcctctgctg agttaaagtt caactgtttt cttctaatgc ccatcgttga taagttacct 
2101 gctcttctcc gggaagagtt ggaaaacgca tttgaagacg acctcgatag tatcttcgac 
2161 atcacgaatc tccggcaatc acttgatcaa aagaaacgga gcacagagat cgagctcaga 
2221 aggataaaga ggataaaaga gaaattcaga gtgatgaatg agaagctaaa ctctcatgaa 
2281 tttgctcaaa atctaaaggc tccttcggtg cagcattga 



/3 
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SEQ ID NO:16 

MAEVSAKSVTVEE^4AEEDDAAIEERWSLYEAyNELHALAQELETPFEAPAVLWGQQTD 
GKSALVEALMGFQFNHVGGGTKTRRPITLHMKYDPQCQFPLCHLGSDDDPSVSLPKSLS 
QIQAYIEAENMRLEQEPCSPFSAKEI IVKVQYKYCPNLTI IDTPGLIAPAPGLKNRALQ 
VQARAVEALVRAKMQHKEFIILCLEDSSDWSIATTRRIVMQVDPELSRTIWSTKLDTK 
IPQFSCSSDVEVFLSPPASALDSSLLGDSPFFTSVPSGRVGYGQDSVYKSNDEFKQAVS 
LREMEDIASLEKKLGRLLTKQEKSRIGISKLRLFLEELLWKRYKESVPLIIPLLGKEYR 
STVRKLDTLSLLLKGTWAPPDKFGETLQDERTQGGAFVGTDGLQFSHKLIPNAGMRLY 
GGAQYHRAMAEFRFLVGAIKCPPITREEIVNACGVEDIHDGTNYSRTACVIAVAKARET 
FEPFLHQKVFSSSHFRLFCVDIVRGEASTHSQEIASNFCISSSGRYCFLWFDGEYLSGH 
EVFLKRVASAFNSFVESTEKSCRDKCMEDLASTTRYVTWSLHNKNRAGLRQFLDSFGGT 
EHNTTSGNAIGFSLPQDALGGTTDTKSRSDVKLSHLASNIDSGSS IQTTEMRLADLLDS 
TLWNRKLAPSSERIVYALVQQIFQGIREYFLASAELKFNCFLLMPIVDKLPALLREELE 
NAFEDDLDSIFDITNLRQSLDQKKRSTEIELRRIKRIKEKFRVMNEKLNSHEFAQNLKA 
PSVQH 



Fig. 15 
SEQ ID NO:17 

MQELYTNRTVLNRPRFAVNVRPTRLKRNQQSQSKMQSHSKDPIN 

AESRSRFEAYNRLQAAAVAFGEKLPIPEIVAIGGQSDGKSSLLEALLGFRFNVREVEM 

GTRRPLILQMVHDLSALEPRCRFQISRIFFVELAILITDLDEDSEEYGSPIVSATAVA 

DVIRSRTEALLKKTKTAVSPKPIVMRAEYAHCPNLTIIDTPGFVLKAKKGEPETTPDE 

ILSMVKSLASPPHRILLFLQQSSVEWCSSLWLDAVREIDSSFRRTIVWSKFDNRLKE 

FSDRGEVDRYLSASGYLGENTRPYFVALPKDRSTISNDEFRRQISQVDTEVIRHLREG 

VKGGFDEEKFRS'CIGFGSLRDFLESELQKRYKEAAPATLALLEERCSrVTD^ 

KIQATSDVAHLRKAAMLYTASISNHVGALIDGAANPAPEQWGKTTEEERGESGIGSWP 

GVSVDIKPPNAVLKLYGGAAFERVIHEFRCAAYSIECPPVSREKVANILLAHAGRGGG 

RGVTEASAEIARTAARSWLAPLLDTACDRLAFVLGSLFEIALERNLNQNSEYEKKTEN 

MDGYVGFHAAVRNCYSRFVKNLAKQCKQLVRHHLDSVTSPYSMACYENNYHQGGAFGA 

YNKFNQASPNSFCFELSDTSRDEPMKDQENIPPEKNNGQETTPGKGGESHITVPETPS 

PDQPCEIVYGLVKKEIGNGPDGVGARKRMARMVGNRNIEPFRVQNGGLMFANADNGMK 

SSSAYSEICSSAAQHFARIREVLVERSVTSTLNSGFLTPCRDRLWALGLDLFAVNDD 

KFMDMFVAPGAIWLQNERQQLQKRQKILQSCLTEFKTVARSL" 
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SEQ ID NO:18 

MANSNTYLTTPTKTPSSRRNQQSQSKMQSHSKDPINAESRSRFEAYNRLQAAAVAFGEK 
LP I PE I VAI GGQSDGKS SLLEALLGFRFNVREVEMGTRRPL I LQMVHDLSALEPRCRFQ 
DEDS EE YGS P I VS ATAVADVI RSRTEALLKKTKTAVS PKP I VMRAE YAHCPNLT 1 1 DT P 
GFVLKAKKGEPETTPDE I LSMVKSLAS PPHRI LLFLQQS S VEWCSSLWLDAVRE IDS S F 
RRTIVWSKFDNRLKEFSDRGEVDRYLSASGYLGENTRPYFVALPKDRSTISNDEFRRQ 
I SQVDTEVI RHLREGVKGGFDEEKFRSCI GFGSLRDFLESELQKRYKEAAPATLALLEE 
RCSEVTDDMLRMDMKIQATSDVAHLRKAAMLYTAS I SNHVGALIDGAANPAPEQWGKTT 
EEERGESGIGSWPGVSVDIKPPNAVLKLYGGAAFERVIHEFRCAAYSIECPPVSREKVA 
NILLAHAGRGGGRGVTEASAEIARTAARSWLAPLLDTACDRLAFVLGSLFEIALERNLN 
QNSEYEKKTENMDGYVGFHAAVRNCYSRFVKNLAKQCKQLVRHHLDSVTSPYSMACYEN 
NYHQGGAFGAYNKFNQASPNSFCFELSDTSRDEPMKDQENIPPEKNNGQET-TPGKGGES 
HI TVPETPS PDQPCE I VYGLVKKEI GNGPDGVGARKRMARMVGNRNI EPFRVQNGGLMF 
ANADNGMKSSSAYSEICSSAAQHFARIREVLVERSVTSTLNSGFLTPCRDRLWALGLD 
LFAVNDDKFMDMFVAPGAIWLQNERQQLQKRQKILQSCLTEFKTVARSL 
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Fig. 17 
SEQ ID NO:19 



1 ttcatgttct tagaagttct aaattttgat 
61 gctatgatat cattccctga tgctacgtac 
121 tataattaaa acttgttaaa ttcatacaca 
181 gatggaggct ctggaacatc tagtggtgcc 
241 tgtctttgtt tggaagaagt aaatttaatt 
301 gtttttctcc cttttcgtgg tatactttgg 
361 cgttaattca tgtgtttgaa aagtaattaa 
421 ggccaatagg atatttaaga gataagaaaa 
481 ctctctctct ttctctctcc ATGAGAACTC 
541 CGTTTCTTAT CTCCGCCGCA TCTCCACCGT 
601 TTACTCCTCC ACGTCATAGG CGTTTTTCTT 
661 CCGCCGATCA GACTTCTTCT TCTAGGCCGC 
721 CCGAACTCGC CGTTCCCGGT TTACTTCTCC 
781 ATCGTGAAGA GACTCTTGAT TTGGTCGACC 
841 TGATTGATGG CGGAGCCACC GCTGGTAAGC 
901 TTGTCAAAGG CCGTGCTTAC CTCTTGATCG 
961 GTGCTAGTGG TGTTGCTCTC TCCGACGAAG 
1021 ttaatttctc atagagtgag ttttgtctct 
1081 TGGCGAGAAA CACATTGATG GGATCCAACC 
1141 GGATTGTGAA GGATGTTGAT TCTGCTCTAA 
1201 TTATACTTGG ATCTGGTGAA GAAGATACGC 
1261 AAATACCGAT ATATGTGACT TGCAGAGGCA 
1321 TGAAATCAGG TGTTTCTGGT TTTGTTATTT 
1381 TAGCTCTTCG CCAGAGTCTT GATGGAGCTT 
1441 TGAATGAACT GCCGGAGAAA AAGAATTCTG 
1501 AACTAATAGT AGAAATGGAG AAATCTGTGT 
1561 CGGCTCCACT Ggtgattttt atttcaaaca 
1621 gttctaagta ggtttttgtg tggttataat 
1681 cattaacagA TGGAGGAAGT CTCCCTTCTA 
1741 TTTCTGATGG TTATAGTGgt aattctgcac 
1801 ttgcattggt attagctcta tattcattcc 
1861 cactagatag cttgagatac aatgggcatg 
1921 aatatcttct ttcgtcgcct atgactatga 
1981 cttctcttct taatttgctt atggatctgg 
2041 accatctgct tgtgtacata gttttttcgc 
2101 ggaagatgtt ttaagtggga caagttgcct 
2161 attaattgga atccacaatt tgctggtact 
2221 TTATCAATGC ACTTCTTGGG AAGAGATACC 
2281 AAATCACGTT TCTGTGCTAC TCTGACTTGG 
2 341 ATCCAGATGG CCAATATGTA TGCTATCTTC 
2401 caaaattcta ccatcgcagt cctgaatttt 
2461 tgttctcctt tcgagcagAT AAATATTGTT 
2521 AGGCAACAGC GTCTTACAGA AGAATTTGTT 
2581 TCTGCTGACC GCCCTTTAAC TGAAAGTGAG 
2641 gttgttgttg tttttgctca atatgtatct 
2701 cgaaagtagt tagttaagtc atgtatagac 
2761 tgtcactagg ttgaatgcat atatcaaggt 
2821 tttattttca aagtaatgag tgttatagct 
2881 atattttggt aaagcttagg ccaatacatt 
2941 actgatttta cgtccatggc aaattgtatg 



catctcttat ttgaaagctc aactaaaata 
taggttttta aattcataca cacacaaatc 
caaaggacaa atcttcttcg tattaaaaaa 
gtatcactta cttgactggt tcaagccgtt 
gtgggagagg gatttcacga atttaaatct 
accttttgga tatgaacaca tatgtgaaaa 
tcgcgccgtc cgtcttatag ctttgggatg 
ctaatcagaa acacagacga aggtatctca 
TAATCTCTCA CCGGCAATGT GTGACGTCAC 
TTCCTGGCCG GTGCTTTAAG TTATCCTCCT 
CTCTCTCGAT CAGAT^CATT TCGCATGAAT 
GAACTCTTTA TCCTGGTGGT TACAAGCGTC 
GGCTAGACGC CGACGAGGTT ATGAGCGGGA 
GTGCTTTAGC TAAATCGGTT CAAATCGTCG 
TCTACGAGGC GGCTTGTTTG CTGAAATCAC 
CTGAACGTGT TGATATCGCC TCCGCCGTTG 
gtaacaactg atttcattca gttttagcat 
caatgctatg tacagGTCTT CCGGCGATTG 
CCGACTCGGT ACTTCTTCCA CTGGTAGCTC 
TTGCCTCAAG CTCCGAGGGT GCTGATTTCC 
AAGTGGCGGA TTCTTTGTTG AAGAGCGTGA 
ATGAAGAAGC TAAAGAAGAA TTGCAGTTAC 
CGTTGAAAGA TTTGCGTTCT TCTAGGGATG 
ATGTTGTAAA TAATCATGAG ACACAAAATA 
CTGGCTTCAT AAAATTAGAG GACAAACAGA 
TGAGAGAGAC GATTGAAATC ATCCACAAGG 
tttggtagtt gaagtcaatt ttttgaaatg 
atggtttcat ttacttcttc gactattttt 
ATTGATGCTG TTTCTCGGAT CGATGAGCCG 
tcaactccgt caaattgtga ttccaggaat 
agaaacattt tagttacaca cttttgccag 
cttctagtca cttgtccttt agtgcttctc 
tgtttcgctt cttcttttgt tctgtctatg 
ttgtaaggga actgcatatt tcttaactgt 
tttcttgtga cttgtgagta tgccgttctt 
ttatgattca aaatagtttt tgtatggata 
agGGGGAATT TAACTCTGGA AAATCAACGG 
TGAAAGAAGG GGTAGTCCCC ACTACCAATG 
AATCCGAAGA GCAACAACGT TGCCAAACAC 
CTGCACCAAT ACTTAAGGAT gtgagtaatt 
tactaattat ttggaggaat tgatttgggt 
GACACACCTG GGACCAATGT GATCCTTCAA 
CCACGTGCAG ATTTGCTTGT TTTTGTTCTT 
gtagaagtta ccgttttact tggcatgtta 
gcctaagtag cttgttagat ctatttttca 
catcaagacc ttgtgtaggg aagggaaagt 
tttgttgatt ataaatttaa actagactaa 
attgctggaa ccagtatgtc ctgttggtcc 
tgagaggtga gttgttattg gtacagcaaa 
taaatgatca tctacgaagt actaacctta 



Fig. 17, continued 2/2 



3001 tgaatatttg gttcttattt tgaaaatctg aaaaagtttc aaaagaagga ataagcttct 
3061 caatgtcatc atacccatgt catttctatc tctacctctg gagcttcctg ctgtcttgat 
3121 tttactgtag gctgatttac atctcattgc gtttgtcagG TTGCGTTTCT CCGGTACACA 
3181 CAGCAGTGGA AAAAGAAATT TGTGTTTATT CTGAATAAAT CTGATATCTA TCGTGATGCT 
3241 CGTGAGgttt atcagaaaca atatttatgt cttttccttg atagtctctg taattgctgg 
3301 atttttcttg actaaagatt aattttactg ctgcagCTTG AGGAAGCTAT TTCATTTGTT 
3361 AAAGAGAATA CACGGAAGTT GCTTAATACA GAAAATGTGA TATTGTATCC GGTGTCCGCA 
3421 CGGTCTGCTC TTGAGGCGAA GCTTTCAACA GCTTCTTTGG TTGGCAGAGA TGATCTTGAG 
3481 ATCGCAGATC CTGGTTCTAA TTGGAGAGTC CAGAGCTTCA ATGAACTTGA GAAATTTCTT 
3 541 TATAGCTTCT TGGATAGCTC AACAGCTACC GGGATGGAGA GAATAAGGCT TAAATTGGAG 
3601 ACACCCATGG CGATTGCTGA GCGTCTCCTT TCTTCTGTGG AAGCTCTTGT GAGACAAGAT 
3661 TGCCTAGCTG CTAGGGAAGA CTTGGCTTCA GCAGACAAGA TTATCAGTCG AACTAAAGAA 
3 721 TACGCGCTTA AGATGGAATA TGAGAGCATT TCTTGGAGAA GGCAGGCTCT CTCGTTGGTA 
3781 T-AAatfectat-tagatattat cttgttgaat cacgaaggag gaaattggat tgttctaa.ct' 
3 841 tggctttttt gtgttttgta ctctggcttt tatcgcagat tgataatgcc agattacaag 
3 901 ttgttgatct gataggaact accctgcgac tatcaagcct tgatcttgcg atctcgtacg 

3 961 tgttcaaagg ggaaaaatcg gcctcagtag cagctacatc caaagttcaa ggtgaaatac 

4 021 tcgctccagc actcacaaat gcgaaagtaa gtgtgatgct ttattctttg agtattggcc 
4 081 taactgggga catgttggtc atatatatga ggtctgagat atagtcacta ttcatgcaga 
4141 aagtaaatat tgtctaacaa tgtcttgttg tgacctgatt gactttacat ttcactgttt 
4201 gcaggaattg cttggaaaat atgctgaatg gctacaatca aatactgccc gtgaagggag 
4261 tctgtctctg aaatcattcg aaa 
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SEQ ID NO:20 



1 ATGAGAACTC TAATCTCTCA CCGGCAATGT 
61 TCTCCACCGT TTCCTGGCCG GTGCTTTAAG 
121 CGTTTTTCTT CTCTCTCGAT CAGAAACATT 
181 TCTAGGCCGC GAACTCTTTA TCCTGGTGGT 
241 TTACTTCTCC GGCTAGACGC CGACGAGGTT 
301 TTGGTCGACC GTGCTTTAGC TAAATCGGTT 
361 GCTGGTAAGC TCTACGAGGC GGCTTGTTTG 
421 CTCTTGATCG CTGAACGTGT TGATATCGCC 
481 TCCGACGAAG GTCTTCCGGC GATTGTGGCG 
541 TCGGTACTTC TTCCACTGGT AGCTCGGATT 
601 TCAAGCTCCG AGGGTGCTGA TTTCCTTATA 
661 GCGGATTCTT TGTTGAAGAG CGTGAAAATA 
721 GAAGCTAAAG AAGAATTGCA GTTACTGAAA 
781 AAAGATTTGC GTTCTTCTAG GGATGTAGCT 
841 GTAAATAATC ATGAGACACA AAATATGAAT 
901 TTCATAAAAT TAGAGGACAA ACAGAAACTA 
961 GAGACGATTG AAATCATCCA CAAGGCGGCT 
1021 GATGCTGTTT CTCGGATCGA TGAGCCGTTT 
1081 GGAAAATCAA CGGTTATCAA TGCACTTCTT 
1141 CCCACTACCA ATGAAATCAC GTTTCTGTGC 
1201 CGTTGCCAAA CACATCCAGA TGGCCAATAT 
12 61 GATATAAATA TTGTTGACAC ACCTGGGACC 
1321 ACAGAAGAAT TTGTTCCACG TGCAGATTTG 
1381 TTAACTGAAA GTGAGGTTGC GTTTCTCCGG 
1441 TTTATTCTGA ATAAATCTGA TATCTATCGT 
1501 TTTGTTAAAG AGAATACACG GAAGTTGCTT 
1561 TCCGCACGGT CTGCTCTTGA GGCGAAGCTT 
1621 CTTGAGATCG CAGATCCTGG TTCTAATTGG 
1681 TTTCTTTATA GCTTCTTGGA TAGCTCAACA 
1741 TTGGAGACAC CCATGGCGAT TGCTGAGCGT 
1801 CAAGATTGCC TAGCTGCTAG GGAAGACTTG 
1861 AAAGAATACG CGCTTAAGAT GGAATATGAG 
1921 TTGGTATAA 



GTGACGTCAC CGTTTCTTAT CTCCGCCGCA 
TTATCCTCCT TTACTCCTCC ACGTCATAGG 
TCGCATGAAT CCGCCGATCA GACTTCTTCT 
TACAAGCGTC CCGAACTCGC CGTTCCCGGT 
ATGAGCGGGA ATCGTGAAGA GACTCTTGAT 
CAAATCGTCG TGATTGATGG CGGAGCCACC 
CTGAAATCAC TTGTCAAAGG CCGTGCTTAC 
TCCGCCGTTG GTGCTAGTGG TGTTGCTCTC 
AGAAACACAT TGATGGGATC CAACCCCGAC 
GTGAAGGATG TTGATTCTGC TCTAATTGCC 
CTTGGATCTG GTGAAGAAGA TACGCAAGTG 
CCGATATATG TGACTTGCAG AGGCAATGAA 
TCAGGTGTTT CTGGTTTTGT TATTTCGTTG 
CTTCGCCAGA GTCTTGATGG AGCTTATGTT 
GAACTGCCGG AGAAAAAGAA TTCTGCTGGC 
ATAGTAGAAA TGGAGAAATC TGTGTTGAGA 
CCACTGATGG AGGAAGTCTC CCTTCTAATT 
CTGATGGTTA TAGTGGGGGA ATTTAACTCT 
GGGAAGAGAT ACCTGAAAGA AGGGGTAGTC 
TACTCTGACT TGGAATCCGA AGAGCAACAA 
GTATGCTATC TTCCTGCACC AATACTTAAG 
AATGTGATCC TTCAAAGGCA ACAGCGTCTT 
CTTGTTTTTG TTCTTTCTGC TGACCGCCCT 
TACACACAGC AGTGGAAAAA GAAATTTGTG 
GATGCTCGTG AGCTTGAGGA AGCTATTTCA 
AATACAGAAA ATGTGATATT GTATCCGGTG 
TCAACAGCTT CTTTGGTTGG CAGAGATGAT 
AGAGTCCAGA GCTTCAATGA ACTTGAGAAA 
GCTACCGGGA TGGAGAGAAT AAGGCTTAAA 
CTCCTTTCTT CTGTGGAAGC TCTTGTGAGA 
GCTTCAGCAG ACAAGATTAT CAGTCGAACT 
AGCATTTCTT GGAGAAGGCA GGCTCTCTCG 
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SEQ ID NO:21 



MRTLISHRQC VTSPFLISAA 
SRPRTLYPGG YKRPELAVPG 
AGKLYEAACL LKSLVKGRAY 
SVLLPLVARI VKDVDSALIA 
EAKEELQLLK SGVSGFVISL 
FIKLEDKQKL IVEMEKSVLR 
GKSTVINALL GKRYLKEGW 
DINIVDTPGT NVILQRQQRL 
FILNKSDIYR DARELEEAIS 
LEIADPGSNW RVQSFNELEK 
QDCLAAREDL ASADKIISRT 



SPPFPGRCFK LSSFTPPRHR 
LLLRLDADEV MSGNREETLD 
LLIAERVDIA SAVGASGVAL 
SSSEGADFLI LGSGEEDTQV 
KDLRSSRDVA LRQSLDGAYV 
ETIEIIHKAA PLMEEVSLLI 
PTTNEITFLC YSDLESEEQQ 
TEEFVPRADL LVFVLSADRP 
FVKENTRKLL NTENVILYPV 
FLYSFLDSST ATGMERIRLK 
KEYALKMEYE SISWRRQALS 



RFSSLSIRNI SHESADQTSS 
LVDRALAKSV QIWIDGGAT 
SDEGLPAIVA RNTLMGSNPD 
ADSLLKSVKI PIYVTCRGNE 
VNNHETQNMN ELPEKKNSAG 
DAVSRIDEPF LMVIVGEFNS 
RCQTHPDGQY VCYLPAPILK 
LTESEVAFLR YTQQWKKKFV 
SARSALEAKL STASLVGRDD 
LETPMAIAER LLSSVEALVR 
LV 
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SEQ ID NO:22 



1 actgtcacaa agaactagaa aaggcaagca aaactcaact atgtcaaaag tgtcacttag 
61 attgattctt gaatagcgag acgaagtatc tgggaaaata cggtactgaa ttaacatctc 
121 cgtcagatca taggttcgga ttgaacagat gacacaatta aacaatgatg aagatcaaga 
181 cactttaatc gactgaattc tagttagaac ttagactaaa agtatttaat acttgaagct 
241 caccacttct cgaatatctt gttccaatcg ttttgatgtg gttccggcac tcaagttctg 
301 tattgttttc aagctgactt tatcagtttt ctgaagtaag tcatatgtgt ctatgcccaa 
361 ttgcgttttt gaattgacat atgttggcca tttgttttcg aatgatttca gagacagact 
421 cccttcacgg gcagtatttg attgtagcca ttcagcatat tttccaagca attcctgcaa 
481 acagtgaaat gtaaagtcaa tcaggtcaca acaagacatt gttagacaat atttactttc 
541 tgcatgaata gtgactatat ctcagacctc atatatatga ccaacatgtc cccagttagg 
601 ccaatactca aagaataaag catcacactt actttcgcat ttgtgagtgc tggagcgagt 
661 atttcacctt gaactttgga" tgtagctgct actgaggccg atttttcccc tttgaacacg 
721 tacgagatcg caagatcaag gcttgatagt cgcagggtag ttcctatcag atcaacaact 
781 tgtaatctgg cattatcaat ctgcgataaa agccagagta caaaacacaa aaaagccaag 
841 ttagaacaat ccaatttcct ccttcgtgat tcaacaagat aatatctaat agaatttata 
901 ccaacgagag agcctgcctt ctccaagaaa tgctctcata ttccatctta agcgcgtatt 
961 ctttagttcg actgataatc ttgtctgctg aagccaagtc ttccctagca gctaggcaat 
1021 cttgtctcac aagagcttcc acagaagaaa ggagacgctc agcaatcgcc atgggtgtct 
1081 ccaatttaag ccttattctc tccatcccgg tagctgttga gctatccaag aagctataaa 
1141 gaaatttctc aagttcattg aagctctgga ctctccaatt agaaccagga tctgcgatct 
1201 caagatcatc tctgccaacc aaagaagctg ttgaaagctt cgcctcaaga gcagaccgtg 
1261 cggacaccgg atacaatatc acattttctg tattaagcaa cttccgtgta ttctctttaa 
1321 caaatgaaat agcttcctca agctgcagca gtaaaattaa tctttagtca agaaaaatcc 
1381 agcaattaca gagactatca aggaaaagac ataaatattg tttctgataa acctcacgag 
1441 catcacgata gatatcagat ttattcagaa taaacacaaa tttctttttc cactgctgtg 
1501 tgtaccggag aaacgcaacc tgacaaacgc aatgagatgt aaatcagcct acagtaaaat 
1561 caagacagca ggaagctcca gaggtagaga tagaaatgac atgggtatga tgacattgag 
1621 aagcttattc cttcttttga aactttttca gattttcaaa ataagaacca aatattcata 
1681 aggttagtac ttcgtagatg atcatttaca tacaatttgc catggacgta aaatcagttt 
1741 tgctgtacca ataacaactc acctctcaaa tgtattggcc taagctttac caaaatatgg 
1801 accaacagga catactggtt ccagcaatag ctataacact cattactttg aaaataaatt 
1861 agtctagttt aaatttataa tcaacaaaac cttgatatat gcattcaacc tagtgacaac 
1921 tttcccttcc ctacacaagg tcttgatggt ctatacatga cttaactaac tactttcgtg 
1981 aaaaatagat ctaacaagct acttaggcag atacatattg agcaaaaaca acaacaacta 
2041 acatgccaag taaaacggta acttctacct cactttcagt taaagggcgg tcagcagaaa 
2101 gaacaaaaac aagcaaatct gcacgtggaa caaattcttc tgtaagacgc tgttgccttt 
2161 gaaggatcac attggtccca ggtgtgtcaa caatatttat ctgctcgaaa ggagaacaac 
2221 ccaaatcaat tcctccaaat aattagtaaa aattcaggac tgcgatggta gaattttgaa 
2281 ttactcacat ccttaagtat tggtgcagga agatagcata catattggcc atctggatgt 
2341 gtttggcaac gttgttgctc ttcggattcc aagtcagagt agcacagaaa cgtgatttca 
2401 ttggtagtgg ggactacccc ttctttcagg tatctcttcc caagaagtgc attgataacc 
2461 gttgattttc cagagttaaa ttccccctag taccagcaaa ttgtggattc caattaatta 
2521 tccatacaaa aactattttg aatcataaag gcaacttgtc ccacttaaaa catcttccaa 
2581 gaacggcata ctcacaagtc acaagaaagc gaaaaaacta tgtacacaag cagatggtac 
2641 agttaagaaa tatgcagttc ccttacaacc agatccataa gcaaattaag aagagaagca 
2701 tagacagaac aaaagaagaa gcgaaacatc atagtcatag gcgacgaaag aagatattga 
2761 gaagcactaa aggacaagtg actagaagca tgcccattgt atctcaagct atctagtgct 
2821 ggcaaaagtg tgtaactaaa atgtttctgg aatgaatata gagctaatac caatgcaaat 
2881 tcctggaatc acaatttgac ggagttgagt gcagaattac cactataacc atcagaaacg 
2941 gctcatcgat ccgagaaaca gcatcaatta gaagggagac ttcctccatc tgttaatgaa 

1^3 



Fig. 20, continiiied 2/2 



3001 aaatagtcga agaagtaaat gaaaccatat tataaccaca caaaaaccta cttagaacca 
3061 tttcaaaaaa ttgacttcaa ctaccaaatg tttgaaataa aaatcaccag tggagccgcc 
3121 ttgtggatga tttcaatcgt ctctctcaac acagatttct ccatttctac tattagtttc 
3181 tgtttgtcct ctaattttat gaagccagca gaattctttt tctccggcag ttcattcata 
3241 ttttgtgtct catgattatt tacaacataa gctccatcaa gactctggcg aagagctaca 
3301 tccctagaag aacgcaaatc tttcaacgaa ataacaaaac cagaaacacc tgatttcagt 
3361 aactgcaatt cttctttagc ttcttcattg cctctgcaag tcacatatat cggtattttc 
3421 acgctcttca acaaagaatc cgccacttgc gtatcttctt caccagatcc aagtataagg 
3481 aaatcagcac cctcggagct tgaggcaatt agagcagaat caacatcctt cacaatccga 
3541 gctaccagtg gaagaagtac cgagtcgggg ttggatccca tcaatgtgtt tctcgccaca 
3601 atcgccggaa gacctgtaca tagcattgag agacaaaact cactctatga gaaattaaat 
3661 gctaaaactg aatgaaatca gttgttacct tcgtcggaga gagcaacacc actagcacca 
3721 acggcggagg cgatatcaac acgttcagcg atcaagaggt aagcacggcc tttgacaagt 
37 81 gatttcagca aacaagccgc ctcgtagagc ttaccagcgg tggctccgcc atcaatcacg 
3 841 acgatttgaa ccgatttagc taaagcacgg tcgaccaaat caagagtctc ttcacgattc 
3901 ccgctcataa cctcgtcggc gtctagccgg agaagtaaac cgggaacggc gagttcggga 
3 961 cgcttgtaac caccaggata aagagttcgc ggcctagaag aagaagtctg atcggcggat 
4021 tcatgcgaaa tgtttctgat cgagagagaa gaaaaacgcc tatgacgtgg aggagtaaag 
4081 gaggataact taaagcaccg gccaggaaac ggtggagatg cggcggagat aagaaacggt 
4141 gacgtcacac attgccggtg agagattaga gttctcatgg agagagaaag agagagagtg 
4201 agataccttc gtctgtgttt ctgattagtt ttcttatctc ttaaatatcc tattggccca 
4261 tcccaaagct ataagacgga cggcgcgatt aattactttt caaacacatg aattaacgtt 
4321 ttcacatatg tgttcatatc caaaaggtcc aaagtatacc acgaaaaggg agaaaaacag 
43 81 atttaaattc gtgaaatccc tctcccacaa ttaaatttac ttcttccaaa caaagacaaa 
4441 cggcttgaac cagtcaagta agtgatacgg caccactaga tgttccagag cctccatctt 
4501 ttttaatacg aagaagattt gtcctttgtg tgtatgaatt taacaagttt taattataga 
4561 tttgtgtgtg tatgaattta aaaacctagt acgtagcatc agggaatgat atcatagcta 
4621 ttttagttga gctttcaaat aagagatgat caaaatttag aacttctaag aacatgaacg 
4681 aataaacaac tattttcttt tcaaaccaac taaggtagat ggtcactgaa agtatataca 
4741 tcagataaaa gttgcttgtt attccagatg aagttggacc gagaaaaaaa aaagttactt 
4801 gttattcaat atgtttggat ctttgtcttg cagattgcta tatagggttg ataatgggct 
4861 tcgttgtaat gggtatacag tgtataagaa tcggccttgt gcaaccaatc ctaatatgtg 
4921 tgtctcatta aggtaagtgc ttaagattag aagagtaaaa cacttgactt atcaactatg 
4981 tcaactaagg gttctatatt tttattaaat aaaaaataat tgaatatttt ttagaatgat 
5041 ttaataaatt taatgctatt gtttgattta aatgtataat tcaccgcgag 'aagaaatttt 
5101 ataactcaaa ttttaaagtt ttaagttgta tttgtttatt ttgttaaatg tttaatattg 
5161 tataattgta ttttgattgt tgtttctcgg atttcacccg tagtacatca tcccatatta 
5221 atatcgaatc aaacccgtca attctaaaat ttcacccgtg gtagtattta attgtataat 
5281 tatattttaa ttgtcattct aagatttcac tcctaattct atcgcaaatt attatcaacc 
5341 caaaccagtc aattctaaaa tatcacccgt agtacaccat cccatattaa tatcgaatca 
5401 agcccgtcaa ttctaggatt tcacccgtgg tagtatttaa ttgtataatt atattttaat 
5461 tgtcattcta ggatttcact cctaattcta tcgcaaatta ttatcaaccc aaaccagtca 
5521 attctaaaat atcacccgta gtacaccatc ccatattaat atcgattcaa actcgtcaat 
5581 tctaggattt cgctcgtggt agtatttaat tgtataatta tattttaatt gtcattttaa 
5641 ctcctagttc tatcgcaaat tcttatcaac ccaaacagtc aattctaaaa tttcacccgt 
5701 agtataaagt ttaaatattt ataatattta aatttcttat aaaagaatca aaatgtgttt 
5761 taaaaaaatt aaagttttaa gttttttttt tttaatattg ttaattttgt ttagtgttta 
5821 agattatata attacattat gattgtcatt atatgttttt ctccatagca tactatccca 
5881 tgttattatc cactcaaacc tgtcacacca tataaccccg tcccgtgaaa ttaaacacaa 
5941 atttgtcatt ttattataaa tttcaaatat ttataaaatt agaaacttca aaaaagatta 
6001 atattgaccc aaacttcatc attgaatttt gagtgttata tctaagattt ctctcgcaat 




Fig. 21 



SEQ ID NO:23 



1 atggaggctc tggaacatct agtgctttgg 
61 aaaactaatc agaaacacag acgaaggtat 
121 actctaatct ctcaccggca atgtgtgacg 
181 ccgtttcctg gccggtgctt taagttatcc 
241 tcttctctct cgatcagaaa catttcgcat 
3 01 ccgcgaactc tttatcctgg tggttacaag 
361 ctccggctag acgccgacga ggttatgagc 
421 gaccgtgctt tagctaaatc ggttcaaatc 
481 aagctctacg aggcggcttg tttgctgaaa 
541 atcgctgaac gtgttgatat cgcctccgcc 
- 601 gaaggtcttc cggcgattgt ggcgagaaac 
661 cttcttccac tggtagctcg gattgtgaag 
721 tccgagggtg ctgatttcct tatacttgga 
781 tctttgttga agagcgtgaa aataccgata 
841 aaagaagaat tgcagttact gaaatcaggt 
901 ttgcgttctt ctagggatgt agctcttcgc 
961 aatcatgaga cacaaaatat gaatgaactg 
1021 aaattagagg acaaacagaa actaatagta 
1081 attgaaatca tccacaaggc ggctccactg 
1141 gtttctcgga tcgatgagcc gtttctgatg 
1201 tcaacggtta tcaatgcact tcttgggaag 
1261 accaatgaaa tcacgtttct gtgctactct 
1321 caaacacatc cagatggcca atatataaat 
1381 cttcaaaggc aacagcgtct tacagaagaa 
1441 gttctttctg ctgaccgccc tttaactgaa 
1501 atggaaggga aagttgtcac taggttgaat 
1561 acacagcagt ggaaaaagaa atttgtgttt 
1621 gctcgtgagc ttgaggaagc tatttcattt 
1681 acagaaaatg tgatattgta tccggtgtcc 
1741 acagcttctt tggttggcag agatgatctt 
1801 gtccagagct tcaatgaact tgagaaattt 
1861 accgggatgg agagaataag gcttaaattg 
1921 ctttcttctg tggaagctct tgtgagacaa 
1981 tcagcagaca agattatcag tcgaactaaa 
2041 atttcttgga gaaggcaggc tctctcgttg 
2101 ctgataggaa ctaccctgcg actatcaagc 
2161 ggggaaaaat cggcctcagt agcagctaca 
2221 gcactcacaa atgcgaaaga attgcttgga 
2281 gcccgtgaag ggagtctgtc tctgaaatca 
2 341 tcaaaaacgc aattgggcat agacacatat 
24 01 ttgaaaacaa tacagaactt gagtgccgga 
2461 cgagaagtg 



gatgggccaa taggatattt aagagataag 
ctcactctct ctctttctct ctccatgaga 
tcaccgtttc ttatctccgc cgcatctcca 
tcctttactc ctccacgtca taggcgtttt 
gaatccgccg atcagacttc ttcttctagg 
cgtcccgaac tcgccgttcc cggtttactt 
gggaatcgtg aagagactct tgatttggtc 
gtcgtgattg atggcggagc caccgctggt 
tcacttgtca aaggccgtgc ttacctcttg 
gttggtgcta gtggtgttgc tctctccgac 
acattgatgg gatccaaccc cgactcggta 
gatgttgatt ctgctctaat tgcctcaagc 
tctggtgaag aagatacgca agtggcggat 
tatgtgactt gcagaggcaa tgaagaagct 
gtttctggtt ttgttatttc gttgaaagat 
cagagtcttg atggagctta tgttgtaaat 
ccggagaaaa agaattctgc tggcttcata 
gaaatggaga aatctgtgtt gagagagacg 
atggaggaag tctcccttct aattgatgct 
gttatagtgg gggaatttaa ctctggaaaa 
agatacctga aagaaggggt agtccccact 
gacttggaat ccgaagagca acaacgttgc 
attgttgaca cacctgggac caatgtgatc 
tttgttccac gtgcagattt gcttgttttt 
agtgaggtag aagttaccgt tttacttggc 
gcatatatca aggttgcgtt tctccggtac 
attctgaata aatctgatat ctatcgtgat 
gttaaagaga atacacggaa gttgcttaat 
gcacggtctg ctcttgaggc gaagctttca 
gagatcgcag atcctggttc taattggaga 
ctttatagct tcttggatag ctcaacagct 
gagacaccca tggcgattgc tgagcgtctc 
gattgcctag ctgctaggga agacttggct 
gaatacgcgc ttaagatgga atatgagagc 
attgataatg ccagattaca agttgttgat 
cttgatcttg cgatctcgta cgtgttcaaa 
tccaaagttc aaggtgaaat actcgctcca 
aaatatgctg aatggctaca atcaaatact 
ttcgaaaaca aatggccaac atatgtcaat 
gacttacttc agaaaactga taaagtcagc 
accacatcaa aacgattgga acaagatatt 
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SEQ ID NO:24 



MEALEHLVLWDGP IG YLRDKKTNQKHRRRYLTLS LSLSMRTL I SHRQCVTS PFL I S AAS PPFPGRCFKLS S 
FTPPRHRRFSSLSIRNISHESADQTSSSRPRTLYPGGYKRPELAVPGLLLRLDADEVMSGNREETLDLVDR 
ALAKSVQIWIDGGATAGKLYEAACLLKSLVKGRAYLLIAERVDIASAVGASGVALSDEGLPAIVARNTLM 
GSNPDSVLLPLVARIVKDVDSALIASSSEGADFLILGSGEEDTQVADSLLKSVKIPIYVTCRGNEEAKEEL 
QLLKSGVSGFVISLKDLRSSRDVALRQSLDGAYVVNNHETQNMNELPEKKNSAGFIKLEDKQKLIVEMEKS 
VLRETIEIIHKAAPLMEEVSLLIDAVSRIDEPFLMVIVGEFNSGKSTVINALLGKRYLKEGWPTTNEITF 
LCYSDLESEEQQRCQTHPDGQYINIVDTPGTNVILQRQQRLTEEFVPRADLLVFVLSADRPLTESEVEVTV 
LLGMEGKVVTRLNAYIKVAFLRYTQQWKKKFVFILNKSDIYRDARELEEAISFVKENTRKLLNTENVILYP 
VSARSALEAKLSTASLVGRDDLEIADPGSNWRVQSFNELEKFLYSFLDSSTATGMERIRLKLETPMAIAER 
LLSSVEALVRQDCLAAREDLASADKIISRTKEYALKMEYESISWRRQALSLIDNARLQWDLIGTTLRLSS 
LDLAISYVFKGEKSASVAATSKVQGEILAPALTNAKELLGKYAEWLQSNTAREGSLSLKSFENKWPTYVNS 
KTQLGIDTYDLLQKTDKVSLKTiQNLSAGTTSKRLEQDIREV- _ . _ 
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SEQ ID NO:25 



69061 acaaagacca gttaaaaacg tgtgtagtat aacttactgg taagtaaagc tataagcaag 
69121 aatctgtacc ttattttctc tctctctagt gagccctgac catccgaatt tcgcattcgc 
69181 caatcgctgt gtttccgtgt gttttccccc tttttggttt tagatttgcc taaaccaatc 
69241 agaacaagag aaacctggaa acaagaacca aaaaaagtgg gctttctctg catcatcatt 
69301 ccacttctgg tccccaactg aaaaggacaa tccaaagcta gatcccttca aattttcctt 
69361 tttgttttcg aaattttcgc aatttttaat attattttgg aagtctatgt ttctttctga 
69421 tctttagcaa caaaggaagg tggaatctgt ttcacgttta cacaaaaaca tgtcaactgg 
69481 agattttctc tttccctaac ttttgaccat acagtatggt ccatacttaa tattctctct 
69541 ttgtttttaa taaaataaaa ggtttggtta tcaagcatat atgtcattag cttaaagcta 
69601 tgactttgtt tagaaaactt aggaggacca tatggcaagc ttttatacag tgttagactt 
69661 ctaacgttaa ttctaaacaa tctccagtat caagcattaa caaggtttat tctagcacct 
697-21 ctggattttt aaaacttctc gaaccaatcc ttaactaaaa aagaaattca agcgttttat 
69781 ctttagaaat cacagctagc atatgctgag aattactctc catggaaact tatactaaga 
69841 ttgttttttt ccctcatatt taagccacta aagtcaaaag attagtacat tgacaactaa 
69901 gtttagatgc tctatgcgga gaatcaattt catatgaatg tatcaagcaa ttcatgaact 
69961 ctaggagacc ataaaatcca attgacagaa aaaatgagtc aactaacata tttacctgtg 
70021 atatgaggta catgtgcagg tcaaagatca gaagaaaatt ttctccatga gtctcttgag 
70081 cttccaactc atccagcgat ttgtatcaca aacaatctga aaaagaagct aaaaaacgtt 
70141 ataccaaagt ttcacgccca taatgctatt gtttggttct ttcaagaacc tccccaatct 
70201 tttgaattcg cattcaaaaa aaccatcagt gagtccattt caagtcggaa ctggcaggta 
70261 ttattcatta tgacaaagta catacacttg ccccccactg aacaatgtca agaagggaaa 
70321 acccgacatt gtgttggaat agctaaagtc tcatctcgtc tcgtgataca tgaaggttat 
70381 caatatcaac ttgtagcaac tgtaatttac ttctaatatc tgataattct ttctggattc 
70441 ctaaaagacg atcaagtctt agctgagctt cttctcgata aggcttggca acaatattca 
70501 caaagttaac tagattactc gtcgcatctg aaagatcttt ttgcatagcg tcttcgagct 
70561 gttgagccaa cgcatcagcc actttattca ccttaccaat tatagcctgt cttcgatatg 
70621 ggaagtttgc tatagccaca tacctgtcac atagattatg ttatgcatac aaccagtctt 
70681 tcttaaaagt cataaatatg cctctagttg caagaaaaaa atacactagg cgtgatctaa 
70741 gaaggtggag taatgagaca ttgggaagag gggaaattta gagcagtgtt attaccctcc 
70801 agcggagcaa aggccaagag caagaagatc ttccagtgtg gtcggtagca ctgaggttag 
70861 aagtgatgca gacagtcctg cagctccaag cccaccaact gtcacaaaga actagaaaag 
7 0921 gcaagcaaaa ctcaactatg tcaaaagtgt cacttagatt gattcttgaa tagcgagacg 
70981 aagtatctgg gaaaatacgg tactgaatta acatctccgt cagatcatag gttcggattg 
71041 aacagatgac acaattaaac aatgatgaag atcaagacac tttaatcgac tgaattc 




Fig. 24 



AtFzo-like Genomic Sequence 

From 

F15K9, AC005278: 
F10O3, AC006550: 



69001 aaaaactttt caaaacttca tgtgttgtga aaacaaaagt tttttggtaa tgaaaactcg 
69061 acaaagacca gttaaaaacg tgtgtagtat aacttactgg taagtaaagc tataagcaag 
69121 aatctgtacc ttattttctc tctctctagt gagccctgac catccgaatt tcgcattcgc 



69181 caatcgctgt gtttccgtgt gttttccccc tttttggttt tagatttgcc taaaccaatc 
69241 agaacaagag aaacctggaa acaagaacca aaaaaagtgg gctttctctg catcatcatt 
69301 ccacttctgg tccccaactg aaaaggacaa tccaaagcta gatcccttca aattttcctt 
69361 tttgttttcg aaattttcgc aatttttaat attattttgg aagtctatgt ttctttctga 
69421 tctttagcaa caaaggaagg tggaatctgt ttcacgttta_ cacaaaaaca tgtcaactgg 
694 81 agattttctc tttccctaac ttttgaccat acagtatggt ccatacttaa tattctctct 
69541 ttgtttttaa taaaataaaa ggtttggtta tcaagcatat atgtcattag cttaaagcta 
69601 tgactttgtt tagaaaactt aggaggacca tatggcaagc ttttatacag tgttagactt 
69661 ctaacgttaa ttctaaacaa tctccagtat caagcattaa caaggtttat tctagcacct 
69721 ctggattttt aaaacttctc gaaccaatcc ttaactaaaa aagaaattca agcgttttat 
69781 ctttagaaat cacagctagc atatgctgag aattactctc catggaaact tatactaaga 
69841 ttgttttttt ccctcatatt taagccacta aagtcaaaag attagtacat tgacaactaa 
69901 gtttagatgc tctatgcgga gaatcaattt catatgaatg tatcaagcaa ttcatgaact 



69961 ctaggagacc ataaaatcca attgacagaa aaaatgagtc aactaacata tttacctgtg 
70021 atatgaggta catgtgcagg tcaaagatca gaagaaaatt ttctccatga gtctcttgag 
70081 cttccaactc atccagcgat ttgtatcaca aacaatctga aaaagaagct aaaaaacgtt 
70141 ataccaaagt ttcacgccca taatgctatt gtttggttct ttcaagaacc tccccaatct 
70201 tttgaattcg cattcaaaaa aaccatcagt gagtccattt caagtcggaa ctggcaggta 
70261 ttattcatta tgacaaagta catacacttg ccccccactg aacaatgtca agaagggaaa 
70321 acccgacatt gtgttggaat agctaaagtc tcatctcgtc tcgtgataca tgaaggttat 



70381 caatatcaac ttgtagcaac tgtaatttac ttctaatatc tgataattct ttctggattc 
704 41 ctaaaagacg atcaagtctt agctgagctt cttctcgata aggcttggca acaatattca 
70501 caaagttaac tagattactc gtcgcatctg aaagatcttt ttgcatagcg tcttcgagct 
70561 gttgagccaa cgcatcagcc actttattca ccttaccaat tatagcctgt cttcgatatg 
70621 ggaagtttgc tatagccaca tacctgtcac atagattatg ttatgcatac aaccagtctt 
70681 tcttaaaagt cataaatatg cctctagttg caagaaaaaa atacactagg cgtgatctaa 
70741 gaaggtggag taatgagaca ttgggaagag gggaaattta gagcagtgtt attaccctcc 
70801 agcggagcaa aggccaagag caagaagatc ttccagtgtg gtcggtagca ctgaggttag 
70861 aagtgatgca gacagtcctg cagctccaag cccaccaact gtcacaaaga actagaaaag 
70921 gcaagcaaaa ctcaactatg tcaaaagtgt cacttagatt gattcttgaa tagcgagacg 



70981 aagtatctgg gaaaatacgg tactgaatta acatctccgt cagatcatag gttcggattg 

71041 aacagatgac acaattaaac aatgatgaag atcaagacac tttaatcgac tgaattc 

tagttagaac ttagactaaa agtatttaat acttgaagct 
241 caccacttct cgaatatctt gttccaatcg ttttgatgtg gttccggcac tcaagttctg 
301 tattgttttc aagctgactt tatcagtttt ctgaagtaag tcatatgtgt ctatgcccaa 
361 ttgcgttttt gaattgacat atgttggcca tttgttttcg aatgatttca gagacagact 

421 cccttcacgg gcagtatttg attgtagcca ttcagcatat tttccaagca attcctgcaa 
481 acagtgaaat gtaaagtcaa tcaggtcaca acaagacatt gttagacaat atttactttc 
541 tgcatgaata gtgactatat ctcagacctc atatatatga ccaacatgtc cccagttagg 
601 ccaatactca aagaataaag catcacactt actttcgcat ttgtgagtgc tggagcgagt 
661 atttcacctt gaactttgga tgtagctgct actgaggccg atttttcccc tttgaacacg 
721 tacgagatcg caagatcaag gcttgatagt cgcagggtag ttcctatcag atcaacaact 
781 tgtaatctgg cattatcaat ctgcgataaa agccagagta caaaacacaa aaaagccaag 

841 ttagaacaat ccaatttcct ccttcgtgat tcaacaagat aatatctaat agaatttata 
901 ccaacgagag agcctgcctt ctccaagaaa tgctctcata ttccatctta agcgcgtatt 
961 ctttagttcg actgataatc ttgtctgctg aagccaagtc ttccctagca gctaggcaat 
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1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 

1681 
1741 
1801 

1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
4201 
4261 
4321 
4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 



cttgtctcac 
ccaatttaag 
gaaatttctc 
caagatcatc 
cggacaccgg 
caaatgaaat 
agcaattaca 
catcacgata 
tgtaccggag 
caagacagca 
aagcttattc 



aagagcttcc 
ccttattctc 
aagttcattg 
tctgccaacc 
atacaatatc 
agcttcctca 
gagactatca 
gatatcagat 
aaacgcaacc 
ggaagctcca 
cttcttttga 



acagaagaaa 
tccatcccgg 
aagctctgga 
aaagaagctg 
acattttctg 
agctgcagca 
aggaaaagac 
ttattcagaa 
tgacaaacgc 
gaggtagaga 
aactttttca 



ggagacgctc 
tagctgttga 
ctctccaatt 
ttgaaagctt 
tattaagcaa 
gtaaaattaa 
ataaatattg 
taaacacaaa 
aatgagatgt 
tagaaatgac 
gattttcaaa 



agcaatcgcc 
gctatccaag 
agaaccagga 
cgcctcaaga 
cttccgtgta 
tctttagtca 
tttctgataa 
tttctttttc 
aaatcagcct 
atgggtatga 
ataagaacca 



atgggtgtct 
aagctataaa 
tctgcgatct 
gcagaccgtg 
ttctctttaa 
agaaaaatcc 
acctcacgag 
cactgctgtg 
acagtaaaat 
tgacattgag 
aatattcata 



aggttagtac ttcgtagatg atcatttaca tacaatttgc catggacgta aaatcagttt 
tgctgtacca ataacaactc acctctcaaa tgtattggcc taagctttac caaaatatgg 
accaacagga catactggtt ccagcaatag ctataacact cattactttg aaaataaatt 
1861 agtctagttt aaatttataa tcaacaaaac cttgatatat gcattcaacc tagtgacaac 
tttcccttcc ctacacaagg tcttgatggt ctatacatga cttaactaac tactttcgtg 
aaaaatagat ctaacaagct acttaggcag atacatattg agcaaaaaca acaacaacta 
acatgccaag taaaacggta acttctacct cactttcagt taaagggcgg tcagcagaaa 
gaacaaaaac aagcaaatct gcacgtggaa caaattcttc tgtaagacgc tgttgccttt 
gaaggatcac attggtccca ggtgtgtcaa caatatttat ctgctcgaaa ggagaacaac 
ccaaatcaat tcctccaaat aattagtaaa aattcaggac tgcgatggta gaattttgaa 
ttactcacat ccttaagtat tggtgcagga agatagcata catattggcc atctggatgt 
gtttggcaac gttgttgctc ttcggattcc aagtcagagt agcacagaaa cgtgatttca 
ttggtagtgg ggactacccc ttctttcagg tatctcttcc caagaagtgc attgataacc 
gttgattttc cagagttaaa ttccccctag taccagcaaa ttgtggattc caattaatta 
tccatacaaa aactattttg aatcataaag gcaacttgtc ccacttaaaa catcttccaa 
gaacggcata ctcacaagtc acaagaaagc gaaaaaacta tgtacacaag cagatggtac 
agttaagaaa tatgcagttc ccttacaacc agatccataa gcaaattaag aagagaagca 
tagacagaac aaaagaagaa gcgaaacatc atagtcatag gcgacgaaag aagatattga 
gaagcactaa aggacaagtg actagaagca tgcccattgt atctcaagct atctagtgct 
ggcaaaagtg tgtaactaaa atgtttctgg aatgaatata gagctaatac caatgcaaat 
tcctggaatc acaatttgac ggagttgagt gcagaattac cactataacc atcagaaacg 
gctcatcgat ccgagaaaca gcatcaatta gaagggagac ttcctccatc tgttaatgaa 
aaatagtcga agaagtaaat gaaaccatat tataaccaca caaaaaccta cttagaacca 
tttcaaaaaa ttgacttcaa ctaccaaatg tttgaaataa aaatcaccag tggagccgcc 
ttgtggatga tttcaatcgt ctctctcaac acagatttct ccatttctac tattagtttc 
tgtttgtcct ctaattttat gaagccagca gaattctttt tctccggcag ttcattcata 
ttttgtgtct catgattatt tacaacataa gctccatcaa gactctggcg aagagctaca 
tccctagaag aacgcaaatc tttcaacgaa ataacaaaac cagaaacacc tgatttcagt 
aactgcaatt cttctttagc ttcttcattg cctctgcaag tcacatatat cggtattttc 
acgctcttca acaaagaatc cgccacttgc gtatcttctt caccagatcc aagtataagg 
aaatcagcac cctcggagct tgaggcaatt agagcagaat caacatcctt cacaatccga " 
gctaccagtg gaagaagtac cgagtcgggg ttggatccca tcaatgtgtt tctcgccaca 
atcgccggaa gacctgtaca tagcattgag agacaaaact cactctatga gaaattaaat 
gctaaaactg aatgaaatca gttgttacct tcgtcggaga gagcaacacc actagcacca 
acggcggagg cgatatcaac acgttcagcg atcaagaggt aagcacggcc tttgacaagt 
gatttcagca aacaagccgc ctcgtagagc ttaccagcgg tggctccgcc atcaatcacg 
acgatttgaa ccgatttagc taaagcacgg tcgaccaaat caagagtctc ttcacgattc 
ccgctcataa cctcgtcggc gtctagccgg agaagtaaac cgggaacggc gagttcggga 
cgcttgtaac caccaggata aagagttcgc ggcctagaag aagaagtctg atcggcggat 
tcatgcgaaa tgtttctgat cgagagagaa gaaaaacgcc tatgacgtgg aggagtaaag 
gaggataact taaagcaccg gccaggaaac ggtggagatg cggcggagat aagaaacggt 
gacgtcacac attgccggtg agagattaga gttctcatgg agagagaaag agagagagtg 
agataccttc gtctgtgttt ctgattagtt ttcttatctc ttaaatatcc tattggccca 
tcccaaagct ataagacgga cggcgcgatt aattactttt caaacacatg aattaacgtt 
ttcacatatg tgttcatatc caaaaggtcc aaagtatacc acgaaaaggg agaaaaacag 
atttaaattc gtgaaatccc tctcccacaa ttaaatttac ttcttccaaa caaagacaaa 
cggcttgaac cagtcaagta agtgatacgg caccactaga tgttccagag cctccatctt 
ttttaatacg aagaagattt gtcctttgtg tgtatgaatt taacaagttt taattataga 
tttgtgtgtg tatgaattta aaaacctagt acgtagcatc agggaatgat atcatagcta 
ttttagttga gctttcaaat aagagatgat caaaatttag aacttctaag aacatgaacg 
aataaacaac tattttcttt tcaaaccaac taaggtagat ggtcactgaa agtatataca 
tcagataaaa gttgcttgtt attccagatg aagttggacc gagaaaaaaa aaagttactt 
gttattcaat atgtttggat ctttgtcttg cagattgcta tatagggttg ataatgggct 
tcgttgtaat gggtatacag tgtataagaa tcggccttgt gcaaccaatc ctaatatgtg 
tgtctcatta aggtaagtgc ttaagattag aagagtaaaa cacttgactt atcaactatg 
tcaactaagg gttctatatt tttattaaat aaaaaataat tgaatatttt ttagaatgat 
ttaataaatt taatgctatt gtttgattta aatgtataat tcaccgcgag aagaaatttt 



Fig. 24 continued 3/3 



5101 ataactcaaa ttttaaagtt ttaagttgta tttgtttatt ttgttaaatg tttaatattg 

5161 tataattgta ttttgattgt tgtttctcgg atttcacccg tagtacatca tcccatatta 

5221 atatcgaatc aaacccgtca attctaaaat ttcacccgtg gtagtattta attgtataat 

5281 tatattttaa ttgtcattct aagatttcac tcctaattct atcgcaaatt attatcaacc 

5341 caaaccagtc aattctaaaa tatcacccgt agtacaccat cccatattaa tatcgaatca 

5401 agcccgtcaa ttctaggatt tcacccgtgg tagtatttaa ttgtataatt atattttaat 

5461 tgtcattcta ggatttcact cctaattcta tcgcaaatta ttatcaaccc aaaccagtca 

5521 attctaaaat atcacccgta gtacaccatc ccatattaat atcgattcaa actcgtcaat 

5581 tctaggattt cgctcgtggt agtatttaat tgtataatta tattttaatt gtcattttaa 

5641 ctcctagttc tatcgcaaat tcttatcaac ccaaacagtc aattctaaaa tttcacccgt 

5701 agtataaagt ttaaatattt ataatattta aatttcttat aaaagaatca aaatgtgttt 

5761 taaaaaaatt aaagttttaa gttttttttt tttaatattg ttaattttgt ttagtgttta 

5821 agattatata attacattat gattgtcatt atatgttttt ctccatagca tactatccca 

5881 tgttattatC CACTCAAACC TGTCACACCA TATAACcccg tcccgtgaaa ttaaacacaa 

5941 atttgtcatt ttattataaa tttcaaatat ttataaaatt agaaacttca aaaaagatta 

6001 atattgaccc aaacttcatc attgaatttt gagtgttata tctaagattt ctctcgcaat 

6061 atatcgtccc gtattaatat cttttatatt gtttaaattt cttgtaaaat ttaatttata 

6121 attttttaaa ctttttaaag tttcaatttt tt'aaaataaa taaccctagg aaacaaacca 

6181 ttttaattta aagataaact ttataaaaag tttttaaaat tataatattt aacttttgat 

6241 aaagttataa tatttataat ttcttgaaac attttaaagt ttcaattctt taaaataata 

6301 aatccgagta aaatcagata actattttaa ttttggacgc ttgataaatc aagcttcctg 

6361 ctcattcgta atcagaatca ttttggtcct tttataatat gggtctgaac cattgtccaa 

6421 tttttctaag cgatgtggga cattgtacac atattatttc ttcataggtt gaataatata 

6481 tgtccgttta aaaaactttg aattacatca tattcagaaa aaaatataat attttattaa 

6541 ctatatatat tttatataaa ttcaaaataa ataaagtata agatcaaata aaaatgaaag 
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ARCS Homologous Sequences 



<A class=dblinks 

href^"javascript:PopUpMenu2^Set(Menu22246438,",",",",");" 

target=_seIf>Links<yA></SPAN></TD></TR><yTBODY><^ 
OCUS BQ860973 

712 bp mRNA linear EST 14-AUG-2002 
. DEFINITION QGC 1 7C24.yg.ab I QG^^ABCDI lettuce salinas Lactuca sativa cDNA clone 

QGC17C24, mRNA sequence. 
ACCESSION BQ860973 
VERSION BQ860973.1 GI:22246438 
KEYWORDS EST. 
SOURCE Lactuca sativa 
ORGANISM <A href="http://www.ncbi.nlm.nih.gov/htbin- 
post/Taxononiy/wgetorg?name=Lactuca+sativa 
">Lactuca sativa</A> 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; 

asterids; campanulids; Asterales; Asteraceae; Cichorioideae; 

Cichorieae; Lactuca. 
REFERENCE 1 (bases 1 to 712) 
AUTHORS Kozik,A., Michelmore,R.W., Kiiapp,S, Matvienko,M., Rieseberg,L., 

Lin,H., van Damme,M., Lavelle,D., Chevalier,?., Ziegle,J., Ellison 

,P., Kolkman,!., Slabaugh,M.S., Livingston,K., Zhou,Y., Lai,Z., 

Church,S., Jackson,L. and Bradford,K. 
TITLE Lettuce and Sunflower ESTs from the Compositae Genome Project 

<A 

href-'http://compgenomics,ucdavis.edu/">http://compgenomics.ucdavis.edu/</A>/ 

JOURNAL Unpublished 
COMMENT Contact: Alexander Kozik [R. W.Michelmore] 

Department of Vegetable Crops, R. W.Michelmore Lab 

University of California at Davis (UCD) 

Asmundson Hall, UCD, Davis, CA 95616, USA 

Tel: l-(530)-742-1742 

Fax: l-(530)-752-9659 

Email: <A href="mailto:akozik@atgc.org">akozik@atgc,org</A> <A href=" 
mailto:[michelmore@vegmail.ucdavis.edu">[michelmore@vegmail.ucdavis.edu</A>] 

singleton, see <A href="http://cgpdb.ucdavis.edu/">http://cgpdb.ucdavis.edu/<A>/ 

for 

details. 

Plate:QGC17 row: C column: 24. 
FEATURES Location/Qualifiers 



Fig. 26 continued 2/9 



source 1..712 

/organism="Lactuca sativa" 
/mol_type="mRNA" 
/cultivar="Salinas" 
/db_xref="<A href==" 
http://www.ncbi.nlm.nih.gov/htbin-post/Taxonomy/wgetorg?id=M236">taxon:4^ 
/clone="QGC17C24" 
/lab^host="E.coli" 

/clone Jib="QG_ABCDI lettuce salinas" 

/note="Vector: pBRcDNASfiAB; The library was constructed 

from 10 different sources of RNA from a single genotype. 

Separate cDNAs were generated using primers that 

incorporated unique 5' and 3' tags to distinguish each 

source of RNA. cDNAs were then pooled, size-fractionated, 

directionally cloned into a custom medium-copy vector and 

transformations made with four size classes to minimize 

size bias. Details of each source of RNA and library 

construction can be obtained at <A href="http://cgpdb.ucdavis.edu/"> 

http://cgpdb.ucdavis.edu/</A>/ 

TAG_LIB=QG_ABCDI lettuce salinas 
TAG_TISSUE=chemical induction 
TAG_SEQ=TGTAGCCGGG" 

BASE COUNT 206 a 152 c 142 g 210 1 2 others 

ORIGIN 

1 ttgttcagct ccgccaaaag aatccaagaa ttggcgtaat ccggctcgat tcttattgtg 
61 aagggaccag gtgacataac gggtggtgct tattagatct tccatgcatt tttcatggca 
121 tgatctttcg gtggattcag caaagttata gaaagcagat gaaacacgtc tcaagaaaac 
181 ttcatggcca cttaggaatt cgccttcttt ctgaagaaga taaacggaga tgggaagtaa 
241 tctcttgaga atgtgaagaa gtcgactgcc caactgatga agaaaaggtt caaaagtatc 
301 acgagctttt gcaacagcga tgacacatgc agtcctggag taatttgttc catcatgaat 
361 atcttcgacc ccacatgcat tcacaatttc ttcacgtgta attgcagggc attttatccc 
421 tccaacaaca aacctaaatt cagccatggc acgatgatat tgtgcacctc catatagacg 
481 catacctgca ttaggtatta gtttgtgtgg gaactgagag ccatcaatac cgattaatgc 
541 ccctccatta accctctcat cttgtagtgt ttccccaaat ttatctggag gtgcaacaac 
601 tgtccctntt catagcagtg ataacttggt aaggaaaaga tcatgaaaag atctcncttt 
661 ctcctttagt ttgacttcat ctaaagtgct gagttcttga tttatgtcat tt 

•// 

</PRE> 
<DL> 
<DT> 

<TABLE cellSpacing=0 cellPadding=0 width="100%"> 
<TBODY> 
<TR> 

<TD><INPUT type=checkbox value=13371 119 name=uidxB>2: </B>BG452325. 
NF086D06LF1F1047 .,.[gi:13371119] </TD> 
<TD align=right><SPAN> 
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<SCRPT language=JavaScriptl.2> 
varMenul3371119 = [ 

['Taxonomy'VVindow.topJocation=7entrez/query.fcgi?db=nucleotide&cmd=Dispiay&do 
nucleotide_taxonomy&from_uid=13371 1 19'","",""], 

["Help","window.open(7entrez/query/static/popup.html\'Links_HelpVresizable==^ 

==yes,toolbar==no,location==Tio,directories===no,status==Tio,menubar^ 

0,height=500');","",""] 

] 

/A-> 

</SCRIPT> ^ 

<A class=dblinks 

href="javascript:PopUpMenu2_Set(Menul3371 119,",",",",");" 

target=_self>Links</A></SPAN></TD></TR></TBODY></TABLE></DT></DI^^ 
OCUS BG452325 

666 bp mRNA linear EST 16-MAR-2001 

DEFINITION NF086D06LF1F1047 Developing leaf Medicago truncatula cDNA clone 

NF086D06LF 5', mRNA sequence. 
ACCESSION BG452325 
VERSION BG452325.1 01:13371119 
KEYWORDS EST. 

SOURCE Medicago truncatula (barrel medic) 
ORGANISM <Ahref=" 
http://www,ncbi.nlm.nih.gov/htbin- 

post/Taxonomy/wgetorg?name=Medicago+truncatula">Medicago 
truncatula</A> 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids 

; eurosids I; Fabales; Fabaceae; Papilionoideae; Trifolieae; 

Medicago. 
REFERENCE 1 (bases 1 to 666) 
AUTHORS Torres-Jerez,I., Scott,A.D., Harris,A.R., Gonzales,R.A,, Bell,C.J., 

Flores,H.R., Inman,J.T., WellerJ.W. and May,G.D. 
TITLE Expressed Sequence Tags from the Samuel Roberts Noble Foundation 

Medicago truncatula leaf library 
JOURNAL Unpublished 
COMMENT Contact: May GD 

Plant Biology Division 

The Samuel Roberts Noble Foundation 

2510 Sam Noble Parkway, Ardmore, OK 73402, USA 

Tel: 580 221 7391 

Fax: 580 221 7380 

Email: <A href="mailto:gdmay@noble.org">gdmay@noble.org</A> 

/5f 



Fig. 26 continued 4/9 



Insert Length: 666 Std Error: 0.00 
Plate: 086 row: D column: 06 
Seq primer: TCACACAGGAAACAGCTATGAC. 
FEATURES Location/Qualifiers 
source 1..666 

/organism="Medicago truncatula" 
/mol_type="mRNA" 
/db_xref="<A href=" 

http://www,ncbi.nlm.nih.gov/htbin-post/Taxonomy/wgetorg?id=3880">taxon:3880</A>" 
/clone="NF086D06LF" 
/tissue_type="leaf' 
/dev_stage="Pooled developmental" 

/clone_lib="Developing leaf ' 

/note="Vector: Lambda Zap; Contains a mixture of very 

young, developing, mature and senescing leaves." 
BASE COUNT 201a 163 c 147 g 154 1 1 others 
ORIGIN 

1 atctaaagta acaaccacca caaaacacaa caatggagga agaaagagaa caccaccaac 

61 tcaaagacaa agaagaaaac gagtggcgtc tctacgaagc ttacaatgaa cttcacgcgc 
121 ttgctcaaga acttcacacg cctttcgacg cgccggcggt actggttgtg ggccaccaaa 
181 cagacgggaa gagcgcctta gttgaggctc taatgggctt ccagttcaac cacgtcggtg 
241 gtggcaccaa aacccgccgg cccattactc ttcacatgaa atatggccca cattgcgagt 
301 ctccttcttg ctatcttctt tctgatgatg acccttctct ttctcaccat atgtcacttt 
361 cccaaatcca gggttatatt gaagctgaga atgcgaggtt ggagcgtgac tcatgttgtc 
421 aattttcagc taaggaaata atcataaaag tggaatacaa atactgtccc aatctcacca 
481 taatagacac accaggatta gttgctcctg caccaggtcg taaaaatagg gcgatacagg 
541 cacaggcacg agcggtagag tcactcgttc gtgcaaaaat gcagcacaag gagttcatta 
601 tactctgtct tgaagattgt agtgattgga gcaatgcgac tacgangcgc gttgtaatgc 
661 aaattg 

// 

</PRE> 
<DL> 
<DT> 

<TABLE cellSpacing=0 cellPadding=0 width='400%"> 
<TBODY> 

<TR> 

<TD><INPUT type=checkbox value=14878353 name=uid><B>3: </B>BI270606. 

NF056G04FL1F1036 ...[gi:14878353] <n^D> 
<TD align=Tight><SPAN> 

<SCRIPT language=JavaScriptL2> 

<!- 

varMenul4878353 = [ 

["Taxonomy" /Vindow.top.location='/entre2yquery.fcgi?db===nucleotide&cmd=Display&dopt= 
nucleotide_taxonomy(&from__uid=14878353"',"",""], 

/5f 
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["HeIp'^'Vindow.open(Ventrez/query/static/popup.htmlVLinks_Help^Vesizable=no,sc^^ 

==yes,toolbar==noJocation=no4irectories=no,status=no,menubar=no,copyhistory^ 

0,height=500');","",""] 

] 

//"> 

</SCRIPT> 

<A class=dblinks 

href="javascript:PopUpMenu2_Set(Menul4878353,";',",",");" 

target=_self>Links</A></SPAN></TD><TR></TBODY><^ 
OCUS BI270606 

663 bp mRNA linear EST 18-JUL-2001 

DEFINITION NF056G04FL1F1036 Developing flower Medicago truncatula cDNA clone 

NF056G04FL 5', mRNA sequence. 
ACCESSION BI270606 
VERSION BI270606.1 01:14878353 
KEYWORDS EST, 

SOURCE Medicago truncatula (barrel medic) 

ORGANISM <A href=" 
http://www.ncbi.nlm.nih.gov/htbin- 

post/Taxonomy/wgetorg?name=Medicago+truncatula">Medicago 
truncatula</A> 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; eudi cotyledons; core eudicots; rosids 
; eurosids I; Fabales; Fabaceae; Papilionoideae; Trifolieae; 
Medicago. 
REFERENCE 1 (bases I to 663) 

AUTHORS Torres-Jerez,I., Scott,A.D., Harris,A.R., Gonzales,R.A., Bell,C.J., 

Flores,H.R., InmanJ.T., WellerJ.W. and May,G.D. 
TITLE Expressed Sequence Tags from the Samuel Roberts Noble Foundation 

Medicago truncatula flower library 
JOURNAL Unpublished 
COMMENT Contact: May GD 

Plant Biology Division 

The Samuel Roberts Noble Foundation 

2510 Sam Noble Parkway, Ardmore, OK 73402, USA 

Tel: 580 221 7391 

Fax: 580 221 7380 

Email: <A href="mailto:gdmay@noble.org">gdmay@noble.org</A> 
Insert Length: 663 Std Error: 0.00 
Plate: 056 row: G column: 04 
Seq primer: TCACACAGGAAACAGCTATGAC. 
FEATURES Location/Qualifiers 
source 1..663 

/organism="Medicago truncatula" 
/mol_type="mRNA" 



Fig. 26 continued 6/9 



/db_xref="<A href=" 

http://www.ncbi.nlm.nih.gov/htbin-post/Taxonomy/wgetorg?id=3880'>taxon:3880</A>' 
/clone="NF056G04FL" 

/tissue_type="Developing flowers" 

/dev_stage="Developmentally pooled. Contains a mixture of 
very young, developing, fully-opened flowers and flowers 
in early transition into pods." 
/clone_lib- 'Developing flower" 

/note="Vector: Lambda Zap; cDNA was prepared from polyA+ 
enriched, pooled samples of equivalent amounts of total 
RNA from very young, developing, fully-opened flowers and 
. flowers transitioning into pods. The cDNA was 
directionally ligated into the Uni-Zap XR vector 
(Stratagene) and packaged using the Gigapack HI Gold 
packaging extracts. Phagemids containing cDNA inserts were 
in vivo excised from the recombinant Uni-ZAP XR vector 
using ExAssist helper phage and the E, coli strain 
XLl-Blue MRF' (Stratagene). Excised plasmids were plated 
using SOLR cells." 

BASE COUNT 191 a 141 c 144 g 187 t 

ORIGIN 

1 gtctttatgg gggtgcacaa tatcatcgag caatggctga atttcgtttt gtagttggag 
61 gaatcaagtg ccctccaatt acccgggaag aaattgtaaa tgcttgtgga gttgaagaca 
121 ttcatgatgg aacaaactac tctaggactg cttgtgtaat tgctgttgca aaggctcatg 
181 atacatttga accttttctt catcagttgg ggtctagatt gttgcacata cttaagagat 
241 tgctcccaat ctctttttat cttcttcaga aagattgtga gtatctaagt ggccatcagg 
301 tgttcctcag gcgtgttgcc tccgccttcg acaactttgc agaatccact gaaaaatcat 
361 gccgtgaaaa atgtatggag gacttggtaa gcaccacacg atatgtctca tggtctctac 
421 acaataagag tcgggcagga ttacgccagt tcttagattc atttggtgga acagaacatt 
481 ccaatgtttg taatgatccc actgcaactg ttctatcaca aacaaatgtg caagagaagg 
541 aagacacaaa gccacaacta gaagtaaagc tcagtcacgt ggcctctgga actgatccta 
601 gcacatccac ccagacagct gaaacaaagc ttgctgacct tcttgatagt acactttgga 
661 ate 

// 

</PRE> 

<DL> 
<DT> 

<TABLE cellSpacing=0 cellPadding=0 width="100%"> 
<TBODY> 
<TR> 

<TDxINPUT type=checkbox value=22485477 name=uidxB>4: </B>BU045400. 

PP_LEa0022H05 f Pe. . . [gi:22485477] </TD> 
<TD align=right><SPAN> 

<SCRIPT language=JavaScriptl.2> 

<!-- 

varMenu22485477 = [ 
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['Taxonomy'V'window.top.location=Ventrez/query.fcgi?db=mucleotide(&cmd=Display 
nucleotide_taxonoiny&from_uid=22485477"',"",""], 

[''Help'V'window,open(yentrez/query/static/popup.htmlVLinks_HelpVresizable==no,^ 

==7es,toolbar==no,location==Tio,directories==Tio,status==no,menubar^ 

0,height=500');","",""] 

] 

//--> 

</SCRIPT> 

<A class=dblinks 

href=••javascript:PopUpMenu2_Set(Menu22485477;^^^^'0^ 

target=_self>Links</A></SPAN></TD></TR></TBODY></TABLE></DT>^ 
OCUS BU045400 

622 bp mRNA linear EST 26-AUG-2002 

DEFINITION PP_LEa0022H05f Peach developing fruit mesocarp Prunus persica cDNA 

clone PP_LEa0022H05f, mRNA sequence. 
ACCESSION BU045400 
VERSION BU045400.1 GI:22485477 
KEYWORDS EST. 
SOURCE Prunus persica (peach) 
ORGANISM <A href="http://www.ncbi.nlm.nih.gov/htbin- 
post/Taxonomy/wgetorg?name=Prunus+persica 
">Prunus persica</A> 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids 
; eurosids I; Rosales; Rosaceae; Amygdaloideae; Prunus. 
REFERENCE 1 (bases 1 to 622) 
AUTHORS Callahan,A., Palmer,M., Main,D., Wing,R. and Abbott,A. 
TITLE Peach Model Genome for Rosaceae 
JOURNAL Unpublished 
COMMENT Contact: Abbott, A. 

Dept of Genetics and Biochemistry 
Clemson University 

122 Long Hall, Clemson University, Clemson, SC 29634, USA 
Tel: 864 656 3060 
Fax: 864 656 6879 

Email: <A href-'mailto:aalbert@clemson.edu">aalbert@clemson.edu</A> 
Total High Quality bases = 553 
Seq primer: TAATACGACTCACTATAGGG 
High quality sequence stop: 622. 
FEATURES Location/Qualifiers 
source 1..622 

/organism="Prunus persica" 
/mol_type="mRNA" 



Fig. 26 continued 8/9 



/cultivar="Loring" 
/db__xref="<A href=" 
http://www.ncbi.nlm.nih.gov/htbin-post/Taxonomy/wgetorg?id=3760'>taxon:3^ 
/clone='TP_LEa0022H05f' 
/tissue_type="Mesocarp" 
/lab_host="E. coli" 

/clone_lib="Peach developing fruit mesocarp" 
/note=" Vector: pBluescript H SK(-); Site__l: EcoRJ; 
Site_2: Xhol; authority=Prunus persica L. Batsh; The 
sequence has been trimmed to remove vector sequence and 
contains a minimum of 100 bases of phred value 20 or 
above. For more details on library preparation and 
sequence analysis go to 

<A href="http://www. genome.clemson.edu/projects/peach"> 
http://www.genome.clemson.edu/proj ects/peach</A>. To order 

this clone go to <A href="http://www.genome.clemson.edu/orders"> 
http://www.genome.clemson.edu/orders</A>" 
BASE COUNT 168 a 125 c 147 g 181 1 1 others 
ORIGIN 

1 gcttatacct aacgcaggaa tgcgtttata tggtggtgca caataccacc gtgccatggc 
61 tgagttccgc tttgtagttg gaggaataaa atgccctcca attacaaggg aagaaattgt 
121 aaatgcatgt ggagttgaag atttacatga tggcacaaac tactcaagga cagcttgtgt 
181 aatagccgtt gcaaaggccc gtgatacatt tgagcctttc cttcatcagt taggttgtag 
241 actcttgcac attctaaaga gattacttcc tatatcagtc tatcttcttc agaaagatgg 
301 tgagtattta agtggccatg aggtgtttct taggcgtgtt gcttctgctt tcaatgactt 
361 tgcagaatct accgaaaggg catgtcgtga aaaatgcatg gaggatttag taagcaccac 
421 ccgctatgtc acctggtccc ttcacaacaa gaatcgagct gggttacgtc aatttttaga 
481 ctcgttcgct ggaacagaac ataacactat gggtagtaat tgcgtacctg ctggtatttc 
541 ccaagattca tcctttgggt ctgttgccaa tgagaaggat actaagtcaa gggcagatgt 
601 gaagctcanc catgtggcgt ct 

// 

</PRE> 

<TABLE cellSpacing=0 cellPadding=0 width="l00%" bgColor=#cccccc> 
<TBODY> 
<TR> 

<TD> 

<TABLE cellSpacing=0 cellPadding=0> 
<TBODY> 
<TR> 

<TD noWrapxINPUT onclick-"GoV (")" type=button value=Display name=""> 
 <SMALL><SELECT onchange=form.view.selectedIndex=selectedIndex 
name=viewl><OPTION value=DocSum>Summary</OPTION> <OPTION 
value=asn>ASN.l</OPTION> <OPTION value=est>EST</OPTION> <OPTION 
value=fasta>FASTA</OPTION> <OPTION value=fasta_xml>TinySeq 
XML</OPTION> <OPTION value=gb selected>GenBank</OPTION> <OPTION 
value=gb_xml>GBSeq XML</OPTION> <OPTION value=gi>GI List</OPTION> 
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<OPTION value=graph>Graphics</OPTION> <OPTION 
value=xml>XML<OPTION> <OPTION 

value=def>default<OPTION></SELECT></SMALI><SMALL> 

  Show: </SMALL><SELECT 

onchange=form.dispinax.selectedIn(iex=selectedIndex 
name=dispmaxl><OPTION value=l>l</OPTION> <OPTION 
value=2>2</OPTION> <OPTION value=5>5</OPTION> <OPTION 
value=10>10</OPTION> <OPTION value=20 selected>20</OPTION> <OPTION 
value=50>50</OPTION> <OPTION value=100>100</OPTION> <OPTION 
value=200>200</OPTION> <OPTION value=500>500</OPTION></SELECT> 

  <INPUT onclick="GoV 
(fonn.SendTo.options[form.SendTo.selectedIndex].value,4)" type=button vaIue="Send to" 
name=""> ' - - 

 <SELECT onchange=form.SendTo.selectedIndex=selectedIndex 

name=SendTol><OPTION value=on selected>File</OPTION> <OPTION 
value=t>Text</OPTION> <OPTION 

.value="Add to Clipboard">Clipboard</OPTIONx/SELECT> 
<yTD></TR></TBODY></TABLE></TD></TR> 
<TR> 
<TD> 

<TABLE cellSpacing=0 cellPadding=0 width="100%"> 
<TBODY> 
<TR> 

<TD align=middle width="50%"> 

<DrV class=medium2>Items 1-4 of 4</DIV></TD> 
<TD align=right width="100%">One page.</TDx/TR><INPUT type=hidden 
value=20 name=showndispmaxxINPUT type=hidden value=0 
name==page><yTBODY><TABLE><TD></TR></TBODY></TABLE></FORM 
<DIV class=mediuml align=center> 

<P><A hxef^''http://www.ncbi.nlni.nih.gov/About/disclaimer,htmr*>DiscIaimer</A> | 
<A liref^"mailto:info@ncbi,nlm.nih.gov">Write to the Help Desk</A><BR><A 
href="http://www.ncbi,nlm.nih.gov/''>NCBI</A> | <A 

href^''ht^://www,nlm.nih.gov/">NLM</A> | <A href="http://www.nih.gov/">NIH</A> 
</P> 

<P> </P></DIV> 

<P class=dblinks align=right><FONT color=#eeeeee size=-5>Jun 19 2003 12:37:45 <!-- 
ipubmed? 

-></FONT></P> 

<SCRIPT language=JAVASCRIPT> /* <!- */ TextFocus (); // -> 
<yBODY></HTML> 



/(>0 



FIG. 27 



Fzo-like Homologous Sequences 



1: BG890612. EST516463 cSTD Sa..[gi: 14267734] 

LOCUS BG890612 752 bp mRNA linear EST 07-MAR-2003 

DEFINITION EST5 16463 cSTD Solanum tuberosum cDNA clone cSTD19A23 5* sequence, 

mRNA sequence. 
ACCESSION BG890612 
VERSION -BG890612.1 01:14267734 
KEYWORDS EST. 
SOURCE Solanum tuberosum (potato) 
ORGANISM Solanum tuberosum 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; 

asterids; lamiids; Solanales; Solanaceae; Solanum. 
REFERENCE 1 (bases 1 to 752) 
AUTHORS van der Hoeven,R., BezzeridesJ., Ewing,E., Cho,J., Chiemingo,A., 

Bougri,0., Buell,C.R., Ronning,C., Tanksley,S. and Baker,B. 
TITLE Generations of ESTs from dormant potato tubers 
JOURNAL Unpublished 
COMMENT Contact: Robin Buell 

The Institute for Genomic Research 

9712 Medical Center Dr, Rockville, MD 20850, USA 

Email: potato-array@tigr,org 

This clone can be obtained from the University of Arizona Genomics 
Institute. Orders can be made through URL: 
http://genome.arizona.edu/orders/ 
Seq primer: M13F-R. 
FEATURES Location/Qualifiers 
source 1..752 

/organism="Solanum tuberosum" 

/mol_type="mRNA" 

/cultivar="Kennebec" 

/db_xref="taxon:4113" 

/clone="cSTD19A23" 

/tissue_type="dormant tuber" 

/dev_stage="one month post-harvest" 

/lab^host="SOLR" 

/cloneJib="cSTD" 

/note="Vector: pBluescript SK(-); Site_l: EcoRI; Site_^2: 
Xhol; This library targets genes expressed in dormant 
tubers. This library was made from sections of dormant 
tuber, avoiding the buds and epidermis. Tubers were stored 
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for one month post-harvest at 4oC. The tuber was peeled, 
well away from the surface. Then it was chopped into 1-2 
mm cubes and immediately frozen in liquid nitrogen. This 
library is noted as P4 in Tanksley lab notebooks." 

BASE COUNT 226 a 144 c 172 g 210 1 

ORIGIN 

1 gcgaatgtga ttcttcaaag gcaacaaagg ctgacggagg aatttgtgcc tcgtgcagat 
61 ctgcttctgt ttctcatgtc tgctgatcga ccattaactg aaagtgaggt tagttttctg 
121 cgttacactc agcagtggag taagaaggtc atttttgtgc tgaacaagtc tgacatatac 
181 aagaataacg gcgagttgga ggaggccatt gcatttatca aagaaaatac acggaaattg 
241 ctgaatacag aatccgtaac actgtatcca gtatctgcac ggctcgctct tgaatcaaag 
301 ctttctactt ttgatggtgc ccttagtcaa aacaatggga gttcaaataa tgattctcac 
361 tggaaaabca agagcttcta tgagcttgag aagtacttgt ctagcttttt ggatteatcc 
421 acaagtactg gaattgagag aatgaagctg aagcttgaaa ctccaattgc cattgcagaa 
481 caactacttt tagcttgtca aggacttgtg agacaagaat gtcagcaagc caaacaagac 
541 ttgctgtttg ttgaggatct tgtcaacagc gtagaagagt gcacaaagaa gctggaagtt 
601 gatagcattc tgtggaagag gcaggttcta tctctgataa actctgctca agcacgtgtt 
661 gtccggcttg tagagtcaac gttacaactg tcaaatgttg atcttgtcgc tacatatgta 
721 ttcagaagag aaaactctac tcaaatgcca gc 

// 

2: AW760673. sl53dl0.yl Gm.clO...[gi:7692570] 

Links 



LOCUS AW760673 492 bp mRNA linear EST 03-DEC-2001 

DEFINITION sl53dl0.yl Gm-cl027 Glycine max cDNA clone GENOME SYSTEMS 
CLONE ID: 

Gm-cl 027-5036 5^ similar to SW:Y0R6_CALSR P40983 HYPOTHETICAL 

PROTEIN IN XYNA 3'REGION mRNA sequence. 
ACCESSION AW760673 
VERSION AW760673,1 GI:7692570 
KEYWORDS EST. 
SOURCE Glycine max (soybean) 
ORGANISM Glycine max 

Eukaryota; Viridiplantae; Streptophyta; Embryoph34a; Tracheophyta; 

Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids 

; eurosids I; Fabales; Fabaceae; Papilionoideae; Phaseoleae; 

Glycine. 

REFERENCE 1 (bases 1 to 492) 
AUTHORS Shoemaker,R., Keim,P., Vodkin,L., ErpeldingJ., Coryell,V., Khanna 
,A., Bolla,B., Marra,M., Hillier,L., KucabaJ., MartinJ., Beck,C., 
Wylie,T., Underwood,K., Steptoe,M., Theising,B., Allen,M., Bowers 
,Y., Person,B., Swaller,T., Gibbons,M., Pape,D., Harvey,N., Schurk 
,R., Ritter,E-, Kohn,S., Shin,T., Jackson,Y., Cardenas,M., McCann 
,R., Waterston,R. and Wilson,R. 
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TITLE Public Soybean EST Project 
JOURNAL Unpublished 

COMMENT Contact: Shoemaker R/Public Soybean EST Project 
Public Soybean EST Project 
Washington University School of Medicine 
4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108, USA 

Tel: 314 286 1800 
Fax: 314 286 1810 
Email: est@watson.wustl.edu 

This clone is available through: ResGen, Invitrogen Corp. 2130 
South Memorial Parkway Hunts ville, AL 35801 For further information 
call: (800)-533-4363 or contact via email: ccu@resgen.com 
hrseft Length: 2209 Std Error: 0.00- 
High quality sequence stop: 411. 
FEATURES Location/Quahfiers 
source 1..492 



/organism-'Glycine max" 

/mol_type="mRNA" 

/db__xref="taxon:3847" 

/clone="GENOME SYSTEMS CLONE ID: Gm-cl027-5036" 
/tissue_type="cotyIedons of 3- and 7-day-old Williams 
seedlings" 
/lab^host="DH10B" 

/cloneJib="Gm-cl027" 

/note="Vector: pBluescript II SK+; Site_l : EcoRI; Site_^2: 

Xhol; This cDNA library was constructed from mRNA isolated 

from cotyledons of 3- and 7-day-old Williams seedlings 

which were propagated on paper towels with distilled 

water. The cotyledons were flash-frozen in liquid 

nitrogen, then lyophilized for 72 hours. Unequal amounts 

of mRNA was used for cDNA synthesis. Stratagene's cDNA 

Synthexix Kit (catalog number 200401) was used to 

synthesize the cDNA. First- stranded synthesis was 

performed with 5 -methyl dCTP, hence the ligated cDNA was 

hemimethylated. A modification of Stratagene's 

first-strand synthesis primer was used. An anchor 

nucleotide (V=A, C, or G) was added to the 3' end of the 

primer [GAGAGAGAGAGAGAGAGAGAACTAGTCTCGAG(T)18] to anchor 

the primer at the 5' end of the poly(A) tract. After 

second- strand synthesis, the cDNA ends were filled in 

with cloned Pfu DNA, ligated to EcoRI adapters and 

subsequently phosphorylated. The Xhol site within the 

first-strand synthesis primer was then restricted by 

digestion with Xhol; all Xhol sites in the cDNA would be 

protected by their hemimethylated status. The cDNA 

constructs were size-fractionated with a 500 bp cutoff, 
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using GibcoBRL Life Technologies' cDNA Size Fractionation 
column. The column eluent was then ligated into 
Stratagene's pBluescript(tm) 11 XR Predigested vector 
(pBluescript II SK(+) that has been digested with EcoRI 
and Xhol, and phosphorylated by Stratagene). 97% of the 
white and blue colonies appear to contain recombinant 
plasmids with cDNA inserts, based on size (n=30). This 
library was constructed by Dr. Paul Keim and Dr. Virginia 
Coryell." 

BASE COUNT 135 a 91c 108 g 158 1 

ORIGIN 

1 tgttgaatga agctattgaa gctatcaaga gggctgcacc tctgatggag gaggtttcac 
61 ttcttaatga tgcggtttct caaattgatg agccattctt aetggttata gtgggggaat 
121 tcaactctgg taaatctacc gtgattaatg cgcttcttgg agaaagatat ctcaaagagg 
181 gagttgttcc aacaactaat gagatcacat ttttacgata tactgactta gatattgaac 
241 aacaacggtg tgaaaggcat ccagatggcc aatatatttg ctacattcct gctccaattc 
301 ttaaagagat gaccattgtt gatacacctg gaactaatgt gattcttcag aggcagcagc 
361 gtcttacaga ggaatttgta ccccgtgcag atttacttct ttttgtcatt tctgctgatc 
421 gccctttaac tggaagtgag attgcttttc ttcgttattc tcagcagtgg aaaaagaaag 
481 cggtctttgtct 

// 

3: BE353824. EST355167 tomato ...[gi:9291800] 

Links 



LOCUS BE353824 446 bp mRNA linear EST 18-MAY-2001 

DEFINITION EST355167 tomato flower buds, anthesis, Cornell University 
Lycopersicon esculentum cDNA clone cTOD6M4, mRNA sequence. 
ACCESSION BE353824 
VERSION BE353824.1 GI:9291800 
KEYWORDS EST. 

SOURCE Lycopersicon esculentum (tomato) 
ORGANISM Lycopersicon esculentum 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; 
asterids; lamiids; Solanales; Solanaceae; Solanum; Lycopersicon. 
REFERENCE I (bases 1 to 446) 
AUTHORS van der Hoeven,R.S., Bezzeredes,J.L., Matem,A.L., Holt,I.E., Liang 
,F., Hansen,T.S., Craven,M.B., Bowman,C.L., Ronning,C.M., Nierman 
,W., Fraser,C.M., Martin,G.B., Giovannoni,J.J. and Tanksley,S.D. 
TITLE Generation of ESTs from tomato flower tissue, anthesis 
JOURNAL Unpublished 
COMMENT Contact: CUGI 

Clemson University Genomics Institute 
Clemson University 



Fig. 27, continued 5/6 



100 Jordan Hall, Clemson, SC 29634, USA 
Email: http://www.genome.clemson.edu/orders/index.html 
5 prime sequence. 
FEATURES Location/Qualifiers 
source 1..446 

/organism="Lycopersicon esculentum" 

/mol_type-"mRNA" 

/cultivar='TA496" 

/db_xref="taxon:4081" 

/clone="cTOD6M4" 

/tissue_type="flower" 

/dev_stage="anthesis" 

/clone_lib="tomato flower buds, anthesis, Cornell 
University" 

/note="Vector: pBlueScript SKO; Site^l: EcoRl ; Site_2: 
Xhol; supplier: Tanksley; Flower buds and flowers were 
taken from greenhouse plants (4-8 wks old, TA496). They 
were immediately frozen in liquid nitrogen and then 
size-separated while remaining frozen." 

BASE COUNT 119 a 82 c 116 g 129 1 

ORIGIN 

1 gagaccatta agtacaattc tataagcagt cttttgaaaa aagatggact tcattggtga 
61 atccgtctga ccaaattgag ttaggaacaa ctggtgtgct ggatagaaaa tctgaagtta 
121 ccataagtgt catagaggat ttcagtgctg cagctgcttc aaaattgctt gagagagata 
181 ttcgtgaagt gttcttgggt acttttggtg gtcttggagc agctggttta tcagcgtcgc 
241 ttctgacatc tgttcttcaa accacattag aagacctcct tgcacttggc ctttgttctg 
301 ctggcgggtt attagcggtc ttcaacttct catcccggag acagcaagtg gtagataaag 
361 taaagaggac tgctgatggc ctttcacgtg aactcgaaga ggctatgcag aaggagctct 
421 tggagacgac tagtaatgtg gaggac 

// 

4: BI136291. F066P17Y Populus ...[gi:18017219] 

Links 



LOCUS BI136291 521 bp mRNA linear EST 31-DEC-2001 

DEFINITION F066P17Y Populus flower cDNA library Populus balsamifera subsp. 

trichocarpa cDNA, mRNA sequence. 
ACCESSION BI136291 
VERSION BI136291.1 GI:18017219 
KEYWORDS EST. 

SOURCE Populus balsamifera subsp. trichocarpa 
ORGANISM Populus balsamifera subsp. trichocarpa 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots; rosids 
; eurosids I; Malpighiales; Salicaceae; Populus, 
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REFERENCE 1 (bases 1 to 521) 

AUTHORS Hertzberg,M., Aspeborg,H., Erlandsson,R., Bjorkbacka,H., Hiltonen 
,T., KarlssonJ., Teeri,T., Gustafsson,P., Bahlerao,R., Jansson,S., 
Nilsson,0., SundbergjB., Nilsson,P., Uhlen,M., Sandberg,G. and 
LundebergjJ. 
TITLE Gene expression in Populus 
JOURNAL Unpublished 
COMMENT Contact: Erlandsson R 
Department of Biotechnology 
Royal Institute of Technology 
Teknikringen 30, Stockholm S- 10044, Sweden 
Tel: 46 8 790 8287 
- - Fax: 46 8 245452 - 

Email: rikerl@biochem.kth.se. 
FEATURES Location/Qualifiers 
source 1..521 

/organism- 'Populus balsamifera subsp. trichocarpa" 
/mol_type="mRNA" 
/sub_species="trichocarpa" 
/db_xref="taxon:3694" 
/clone_lib="Populus flower cDNA library" 
/note="Organ: flower" 
BASE COUNT 143 a 87 c 135 g 156 1 
ORIGIN 

1 tggtgttgtg ctgtctgatc aagggcttcc tgcccttgtg gcaagaaata tgatgatggg 
61 ttctcgaact gaatcagttg ttctaccttt ggtagccagg attgtgcaga caccatatgc 
121 tgcattaaat gcgtctaatt ctgaaggtgc tgattttctt atatatgttc atggcccaga 
181 ggatgatcct gatgtagaaa tgagccctgg attcgggaat gtgaagatac caatctttgt 
241 cctcaatgct tcacgtgggg aggacacatt gtcggtgggg gcatcaaaat ttctgaaaac 
301 cggtgctagt ggtttagttc tgtcattgga agatttgagg ttatttagcg atgatgcttt 
361 gagtcagatg tttgacactc tgagtgcaac cggtaaaaac tttcaggatg accttgaaag 
421 cttcagtaag ctcaaatcta tggatatgga aaatgatatt catgaaaaaa caacggtggc 
481 aggctttgtt aaactggagg atagagaaaa acagctcata g 



